f  A 

OX 


SiGNATRON* 


AFCRL-69-0256 


0* 


INVESTIGATION  OF  FACTORS  AFFECTING 
THE  QUALITY  OF  VOCODER  SPEECH 


by 


PI  Thomas  H.  Crystal 

gr&NATRON ,  Inc.,  594  Marrett  Road,  Lexington,  Massachusetts  02173 

05 
<£> 


Contract  No.  F19G28-67-C-0292 

Project  No.  4610 
Task  No.  461002 
Unit  No.  46100201 


D  D  C 

£®P'Ti/ir? 
-  t  !  jl>3 

iiEiioEirtrs 

B 


FINAL  REPORT 

Period  Covered:  April  15,  1967  through  May  17,  1969 

May  17,  1969 


Contract  Monitor:  Caldwell  P.  Smith 

Data  Sciences  Laboratory 


Distribution  of  this  document  is  unlimited.  It  may  be 
released  to  the  Clearinghouse,  Department  of  Commerce, 
for  sale  to  the  general  public. 


Prepared 

for 

AIR  FORCE  CAMBRIDGE  RESEARCH  LABORATORIES 
OFFICE  OF  AEROSPACE  RESEARCH 
UNITED  STATES  AIR  FORCE 
BEDFORD,  MASSACHUSETTS  01730 


Reor  educed  by  the 

CLEARINGHOUSE 
for  Federal  Scientific  &  Technics! 
Information  Springfield  Va.  22151 


X%- 


% 


{  Vy 

SIGNATRON— 

AFCRL -69-0256 

INVESTIGATION  OF  FACTORS  AFFECTING 
THE  QUALITY  OF  VOCODER  SPEECH 

by 

Thomas  H.  Crystal 

SIGNATRON,  Inc.,  594  Marrett  Road,  Lexington,  Massachusetts  02173 


Contract  No.  Ft 3628-67 -C-0292 

Project  No.  461C 
Task  No.  461002 
Unit  No.  46100201 


FINAL  REPORT 

Period  Covered:  April  15,  1967  through  May  17,  1969 


May  17,  1969 


Contract  Monitor:  Caldwell  P.  Smith 

Data  Sciences  Laboratory 


Distribution  of  this  document  is  unlimited.  It  may  be 
released  to  the  Clearinghouse,  Department  of  Commerce, 
for  sale  to  the  general  public. 


Prepared 

for 

AIR  FORCE  CAMBRIDGE  RESEARCH  LABORATORIES 
OFFICE  OF  AEROSPACE  RESEARCH 
UNITED  STATES  AIR  FORCE 
BEDFORD,  MASSACHUSETTS  01730 


Qualified  requestors  may  obtain  additional  copies  from  the 
Defense  Documentation  Center.  All  others  should  apply  to  the 
Clearinghouse  for  Federal  Scientific  and  Technical  Information. 


ABSTRACT 


Research  into  and  the  development  of  instru¬ 
mentation  for  the  investigation  of  factors 
affecting  the  quality  of  vocflded  speech  are 
documented.  The  work  reported  was  specifically 
concerned  with  developing  a  better  understanding 
of  the  role  of  the  vocal  source  in  the  production 
both  of  synthetic  speech  and  of  natural  speech. 
The  design  of  and  operating  instructions  for  the 
VOTIF  vocal  track  inverse  filter  -  built  as  part 
of  the  program  -  are  presented.  A  theoretical 
determination  of  the  interaction  between  the 
vocal  source  and  vocoder  channel  f i Iters  has 
been  made  and  the  effect  of  spectrum  flattening 
on  the  peak  factor  and  power  of  a  vocoder  channel 
have  been  computed.  Lastly,  the  pulsed  excita¬ 
tion  of  resonances  is  discussed.  A  form  of 
pitch  jitter  which  could  either  maximize  vocal 
output  or  minimize  vocal  tract  impedance  effects 
i s  reported  on . 


i 


SIGNATRON' 


FOREWORD 


% 

This  report  describes  research  and  instrumentation 
development  activities  undertaken  by  SIGNATRON,  Inc.  of 
Lexington,  Massachusetts  to  investigate  factors  in  both 
natural  and  synthetic  speech  which  could  influence  the 
quality  of  vocoded  speech.  These  activities  were  carried 
out  under  Contract  No.  F19628-67-C-0292,  beginning  April 
15,  1967  and  ending  May  7,  1969.  The  monitor  of  the 
contract  was  Mr.  Caldwell  P.  Smith,  CRBS,  Air  Force 
Cambridge  Research  Laboratories  at  Bedford.  Massachusetts. 

Dr.  Thomas  H.  Crystal  of  SIGNATRON  was  project  director  and 
principal  investigator. 

Many  people  other  than  the  author  of  this  report 
contributed  to  this  program.  Charles  L.  Jackson  and 
Yogindiran  Amarasingham  participated  in  the  assembly  and 
testing  of  the  VOTIF  vocal  track  inverse  filter.  Donald  S. 
Arnstein  participated  in  the  calculation  of  the  effects  of 
pitch  jitter.  The  staff  of  Design  Automation  of  Lexington, 
Massachusetts  (through  a  subcontract)  designed  and  constructed 
the  VOTIF  filtering  units  to  SIGNATRON  specifications.  They 
also  prepared  the  appendix  to  this  report  in  which  the  design 
and  operation  of  the  filtering  units  is  described. 


SIGNATRON’®' 


ii 


TABLE  OF  CONTENTS 


Section  Page 

I  INTRODUCTION  1-1 

1.1  VOTIF  Instrumentation  1-1 

1.2  Theoretical  investigations  1-3 

1.2.1  Source-System  Interaction 

in  Channel  Vocoders  1-3 

1.2.2  Pulsing  of  Resonators  1-4 

II  INVERSE  FILTERING  WITH  VOTIF  2-1 

2.1  Background  2-1 

2.2  Design  Considerations  2-1 

2.2.1  Performance  Specifications  2-1 

2.2.2  Other  Design  Considerations  2-4 

2.3  Use  of  VOTIF  2-4 

2.3.1  Planned  Use  on  Speech  2-4 

2.3.2  Use  of  VOTIF  on  Synthetic  Signals  2-6 

III  SOURCE  SYSTEM  INTERACTION  IN  THE  CHANNEL  v -CODER  3-1 

3.1  The  Effect  of  Pitch  Rate  on  Channel 

Filter  Output  3-1 

3.2  The  Effect  of  Spectrum  Flattening  on 

the  Synthesized  Signal  3-4 

IV  PULSING  OF  RESONATORS  4-1 

4.1  Periodic  Pulsing  of  a  Resonator  4-1 

4.2  Alternate  Pulsing  of  a  Resonator  4-6 

References  R-l 

Appendix  A  Instruction  Manual  for  VOTIF  Filtering  Units 


SIGNATRQ# 

iii 


LIST  OF  ILLUSTRATIONS 


Figure  Page 

1.1  Tuning  Range  of  Frequency  and  Bandwidth 

Control  Settings  1-2 

2.1  Cancellation  of  VOTIF  Resonance  by  VOTIF  Null 

with  1  msec,  100  pps  pulse  input  2-7 

2.2  VOTIF  Analysis  of  Two-Formant  Synthetic  Speech  2-9 

3.1  Model  of  Single  Channel  of  Spectrum  Flattening 

Synthesizer  3-5 

3.2  Effect  of  Spectrum  Flattening  on  Channel  Power  3-10 

3., 3  Effect  of  Spectrum  Flattening  on  Peak  Factor  3-11 

4.1  Transmission  of  Components  by  a  Resone.nce  4-2 

4.2  Harmonic  Oscillator  Behavior  4-2 

4.3  Harmonic  Oscillator  Response  4-5 

4.4  Model  for  Generation  of  Alternating  Period 

Pulses  4-7 

4.5  Effect  of  Jitter  on  Component  Amplitudes  4-9 

4.6  Response  power  for  alternated  and  constant 

period  pulses  exciting  a  resonator  of  F *  300  Hz, 

BW  =  50  Hz .  4-13 

4.7  Resnonse  power  for  alternated  and  constant  period 
pulses  exciting  a  resonator  of  F = 500  Hz,  BW  =  50  Hz  4-14 

4.8  Response  power  for  alternated  and  constant  period 
pulses  exciting  a  resonator  of  F =  700  Hz,  BW  =  50  Hz  4-15 


SIGNATRON® 

iy 


I .  introduction 


This  document  reports  on  research  and  development  done  to 
investigate  factors  affecting  the  quality  of  vocoded  speech. 

The  work  reported  on  was  specifically  concerned  with  developing 
a  better  understanding  of  the  role  of  the  vocal  source  in  both 
the  production  of  natural  and  the  production  of  synthetic  speech. 
The  major  part  of  the  work  was  the  development  of  instrumentation 
for  performing  experimental  work  in  this  area.  Some  theoretical 
investigations  were  also  carried  out. 

1.1  VOTI?  Instrumentation 

The  instrumentation  developed  has  been  designated  as  VOTIF 
for  Vocal  Tract  Inverse  Filter.  VOTIF  consists  of  a  multi-unit 
analog  filtering  instrument  and  associated  display  and  monitoring 
equipment.  The  filtering  instrument  is  a  cascade  of  units  of  two 
types.  Null  or  anti-resonances  are  used  to  cancel  vocal  tract 
resonances  or  formants.  A  resonance  is  used  to  cancel  the  vocal 
tract  anti-resonance  introduced  with  an  additional ■ resonance,  by 
coupling  of  the  oral  cavity  with  the  nasal  cavity. 

VOTIF  presently  contains  five  operationally  identical  null 
units  and  one  resonance  unit.  The  frequencies  and  bandwidths  of 
each  unit  are  adjustable  over  the  range  shown  in  Figure  1.1.  Both 
the  frequency  and  the  bandwidth  of  each  unit  may  be  set  to  a 
precision  of  within  0.5%  of  the  frequency  value.  The  readings 
obtained  are  within  ±2%  and  ±10%  of  the  actual  frequency  and 
bandwidth,  respectively.  Over  a  frequency  from  100  Hz  to  10  kHz, 
the  transfer  function  is  accurate  to  within  ±0.25  dB  of  magnitude 
and  ±0.10  milliseconds  of  delay.  Full  specifications  and  operat¬ 
ing  instructions  for  the  filtering  units  are  given  in  Appendix  A 
of  this  report.  These  specifications,  which  were  developed  by 
SIGNATRON,  are  discussed  in  Section  II.  The  display  and  monitoring 
equipment  consists  of  a  dual  trace  oscilloscope,  a  camera  for  the 
oscilloscope  end  a  multi-function  meter  for  checking  signal  levels, 
power  supply  levels  and  circuit  resistance. 


1-1 


SIGNATRON® 


RANGE  OF  FREQUENCY  AND  BANDWIDTH 
)L  SETTINGS 


1-2 


BW  Low  Range 


1 . 2  Theoretical  Investigations 


The  theoretical  researches  done  under  this  program  all  fall 
into  the  general  area  of  Source-System  Interaction.  Such  inter¬ 
action  exists  both  in  the  human  and  in  synthetic  speech  systems. 

In  synthetic  speech  systems  it  may  exist  in  either  or  both  the 
analyzer  and  the  synthesizer.  By  source  we  refer  to  the  vocal 
cords,  in  the  human,  or  pitch  generator,  in  a  synthesizer  (hiss 
excitation  is  not  being  considered).  By  system,  we  refer  to  the 
spectrum-shaping  part  of  the  production  system.  In  the  human, 
this  is  the  vocal  tract;  in  the  synthesizer,  the  variable  gain 
filters  or  the  adjustable  resonators.  For  convenience  we  will  as¬ 
sume  that  the  effect  of  glottal  pulse  shape  is  ^art  of  the 
system. 

Previous  consideration  of  source-system  interaction  has  led 
to  the  improvement  of  channel  vocoder  speech  through  spectrum 
flattening,  to  debates  on  the  origin  of  the  residual  ripple  in  inverse 
filtered  speech, and  to  theoretical  consideration  of  vocal  source  fre¬ 
quency  optimized  according  to  the  tuning  of  the  vocal  tract  (House, 
'959).  This  program's  consideration  of  source-system  interaction 
was  made  in  two  areas.  First,  we  considered  source-system  inter¬ 
action  in  the  channel  vocoder.  Secondly,  we  considered  the  excita¬ 
tion  of  resonators  by  periodic  pulses. 

1.2.1  Source-System  Interaction  in  Channel  Vocoders 

Source  system  interaction  in  the  channel  vocoder  results  be¬ 
cause  the  energy  in  any  one  of  the  analysis  or  synthesis  bands  is 
a  function  of  the  pitch  frequency  and  pulse  shape  as  well  as  the 
transfer  function  of  the  vocal  tract.  According  to  standard  vo¬ 
coder  design  techniques,  this  interaction  is  accepted  in  the 
analysis  and  compensated  for  in  the  synthesis  by  spectrum  flatten¬ 
ing.  This  procedure  appears  to  ^ork  very  well  but  is  open  to  some 
questioning  on  theoretical  grounds.  The  results  of  our  investiga¬ 
tions  indicate  that  this  compensation  procedure  should  not  generally 
be  criticized  because  the  order  of  the  measured  errors  appear  suf¬ 
ficiently  low.  Nevertheless  we  feel  the  questions  discussed  below 
were  worth  asking. 

SIGNATRON® 


The  first  question  relates  to  the  digital  encoding  of  the 
measured  channel  outputs  of  the  analyser.  This  encoding  involves 
quantization  of  the  analog  measurements  and  in  more  sophisticated 
systems  such  as  pattern  matching  vocoders  —  statistical  reduction 
on  the  patterns.  The  question  thus  arises  as  to  whether  the 
quantization  of  spectrum  information,  as  affected  by  pitch  rate 
information  which  is  also  transmitted  independently,  seriously 
degrades  the  digital  specification  of  the  system  information.  In 
other  words,  would  the  quantization  and  transmission  benefit  from 
removal  of  the  pitch  rate  information.  For  the  pattern  matching 
vocoder,  we  might  also  inquire  if  the  pitch  rate  information  which 
is  superimposed  on  the  system  information,  appreciably  increases 
the  number  of  patterns  which  must  be  processed.  In  an  attempt  to 
clarify  the  question,  the  first  part  of  Section  3  presents  a  deter¬ 
mination  of  the  amount  of  interaction.  In  terms  of  the  4  dB  quant¬ 
ization  steps  commonly  used  in  vocoder  measurements,  the  effect 
appears  not  to  be  too  serious,  but  such  a  determination  is  mere 
properly  made  from  actual  trials  rather  than  the  theoretical  con¬ 
siderations  presented  here.  A  doubt  about  this  conclusion  persists 
because, if  the  pitch  rate  were  actually  to  have  no  effect  on  the 
analysis,  spectrum  flattening  would  not  be  needed  at  the  synthesizer. 

The  second  question  raised  pertains  to  the  effect,  on  the 
synthesized  speech  waveform,  of  the  spectrum  flattening  method 
commonly  used.  This  method  is  the  infinite  clipping  of  the  source 
signal  after  it  has  been  filtered  by  one  of  a  pair  of  channel  fil¬ 
ters  for  the  channel.  From  theoretical  considerations,  it  will  be 
shown  that  this  approach,  in  the  worst  case  calculated,  corrects 
the  spectrum  to  within  2-5  dB  of  the  desired  power  level.  This 
is  the  expected  effect  of  spectrum  flattening.  Less  appreciated 
is  the  fact  that  spectrum  flattening  does  not  seriously  distort 
the  peak-factor  of  the  signal.  As  will  be  shown  below,  the  worst 
case  calculated  displays  a  peak-factor  error  of  less  than  2  dB. 

1.2.2  Pulsing  of  Resonators 

As  noted  above,  a  second  consideration  in  the  area  of  source- 
system  interaction  is  that  of  pulsed  excitation  of  resonances.  There 

SIGNATRON®  1-4 


apwii 


is  interaction  in  the  sense  that  the  amplitude  of  the  resonator 
output  can  be  optimized  by  proper  selection  of  the  pulse  rate  so 
that  harmonics  fall  at  the  maximum  of  the  resonance  tuning  curve. 
This  phenomenon  may  be  observed  not  only  in  the  frequency  domain 
but  by  calculations  based  on  rotating  vectors.  We  present  these 
methods  in  Section  4. 

An  interesting  extension  of  the  above  theory  and  observations 
gives  a  possible  explanation  of  alternate  period  jitter  in  pitch 
periods.  This  phenomenon  of  alternately  long  and  short  pitch 
periods  has  been  observed  by  Lieberman  (1961)  to  occur  in  about 
40%  of  vocalizations  and  has  also  been  noted  by  Smith  (1968)  in 
selected  data.  As  is  explained  in  Section  4,  the  very  occurrence 
of  alternate  period  jitter  doubles  the  number  of  spectral  compo¬ 
nents,  thus  increasing  the  chance  that  a  component  will  fall  on  or 
near  the  peak  of  the  resonance  tuning  curve.  The  amount  of  the 
jitter  can  then  be  used  to  accentuate  the  specific  component  nearest 
the  peak.  That  this  is  the  controlinq  factor  in  actual  pitch  jitter 
is  a  matter  of  hypothesis.  The  theory,  however,  leads  to  formulas 
for  the  calculation  of  jitter  as  a  function  of  pitch  and  formant 
frequency  and  thus  provides  a  basis  for  subsequent  verification. 

The  topic  of  vocal  energy  optimization  bears  some  discussion. 
The  suggestion  that  this  may  actually  occur  implies  the  existence, 
as  part  of  the  human  speech  production  system,  of  a  measurement 
and  control  mechanism  for  sensing  and  improving  vocal  efficiency. 
While  this  may  seem  improbable  on  a  neurological  basis  it  could 
occur  on  a  physical  basis.  Physical  systems  tend  to  operate  in 
modes  which  minimize  certain  types  of  energy.  As  a  coupled  physical 
system,  the  larynx  and  the  vocal  tract  could  function  in  this  mat¬ 
ter.  On  the  other  hand,  the  maxima  of  vocal  tract  transmission  are 
also  maxima  of  vocal  tract  impedance.  The  result  is  a  tendency  of 
the  vocal  tract  to  resist  being  driven  at  rates  producing  components 
falling  on  the  resonances  (Crystal,  1966).  Simple  modification  of 
the  jitter  formulas  can  lead  to  determination  of  amounts  of  jitter 
which  reduce  a  component  which  would  otherwise  occur  at  a  resonance. 

SIGNATRON® 


1-5 


A  third  facet  of  the  program  described  by  this  final  report 
was  the  intended  computer  simulation  of  a  model  of  the  Vocal 
Response  Synthesizer  (VRS)  vocoder  synthesizer.  This  facet  of  the 
program  was  discontinued  when  it  appeared  more  advantageous  to 
devote  program  resources  to  the  other  areae. 


SIGNATRCN® 


1-6 


II.  INVERSE  FILTERING  WITH  VOTIF 


2.1  Background 

The  concept  of  inverse  filtering  is  a  natural  consequence 
of  the  acoustic  theory  of  speech  production  CFtavt,  I960).  The  theory 
of  production  describes  the  vocal  tract  as  a  mechanism,  for  per¬ 
forming  linear,  minimum  phase ,  acoustic  filtering  of  the  air  flow 
through  the  glottis.  The  filter  is  characterized  by  having  an 
infinite  number  of  poles  or  resonances  located,  on  the  average, 
at  the  odd  harmonics  of  500  Hz.  In  general,  during  vocalization, 
only  the  first  three  or  four  of  these  resonances  are  excited, 
with  an  extra  pole  and  stable  zero  (anti-resonance)  entering 
into  the  filter  during  uhe  production  of  nasal  sounds.  A  natural 
consequence  of  this  theory  is  that  each  significant  pole  may  be 
canceled  with  a  zero  (or  anti-reson  or  null)  of  the  same  frequency 
and  bandwidth.  Likewise,  the  zero  may  be  cancelled  by  a  pole. 

One  verification  and  application  of  acoustic  theory  of  speech 
production  is  the  successful  construction  and  use  of  inverse 
filters  by  other  researchers  [Mathews,  et.al.  (1961),  Holmes  (1962) 
and  Linqvist  (1964  and  1965)]. 

VOTIF  was  built  to  provide  the  Digital  Speech  Branch  of 
AFCRL  with  the  equipment  to  study  vocal  source  characteristics 
for  their  possible  effect  on  vocoded  speech  quality.  In  building 
this  equipment  we  sought  to  utilize  the  latest  in  solid  state 
technology,  be  able  to  handle  wide-band  speech,  permit  the  use 
of  direct-reading  linear  controls  and  give  ease  of  calibration. 

The  specific  design  considerations,  circuitry  and  operating 
instructions  for  the  filters  appear  as  Appendix  A  to  this  report. 

2.2  Design  Considerations 
2.2.1  Performance  Specifications 

The  target  specifications,  which  were  often  exceeded  in  the 
instrument  itself,  were  derived  from  considerations  of  both  the  human 
speech  production  and  hearing  mechanisms  as  previously  characterized 

SIGNATRON® 


2-1 


by  other  researchers 


X.  Tuning  Range 

Tuning  range  is  presented  in  Figure  1.1.  The  lower 
bound  on  the  frequency  is  one  cited  by  Flanagan 
(1965)  as  a  design  criterion  for  formant  vocoders 
and  is  a  little  over  half  the  lowest  formant  fre¬ 
quency  (^190  Hz)  measured  by  Peterson  and  Barney 
(1952).  The  upper  limit  of.  the  tuning  range  will 
permit  matches  to  most  fourth  formants  and  provide 
for  a  sharp  glottal  pulse. 

2.  Precision  and  Accuracy 

T^e  criterion  for  choosing  the  precision  is  that  the 
adjusted  values  of  frequency  and  bandwidth  must 
approach  the  target  values  closely  enough  so  that  the 
ripple  remaining  from  incomplete  cancellation  will 
not  seriously  distort  the  waveform  of  the  glottal 
pulse.  In  this  case  the  ripple  was  evaluated  by 
looking  at  the  area  under  the  maximum  lobe  of  the 
ripple  and  saying  that  this  area  should  not  exceed 
2.5%  of  the  area  of  the  desired  response.  This  ripple 
is  -'btained  by  first  finding  the  Laplace  transform 
of  the  combined  transmission  of  resonance  and  null 

G( s )  =  H(S).PU)  = 

(s+b)^  + 

-  i  4.  2e(s+b)  +  26*a  e2  +  62 

(s+b)  +  a  (s+b) ^  +  a^ 


where 

e  =  error  in  adjusting  bandwidth  (radian) 

6  =  error  in  adjusting  frequency  (radia  ») 

For  small  e  and  6  the  last  term  is  negligible  and  we 
have  for  the  impulse  response 

Vi.  , 

g(t)  =  uQ(t)  +  e~  {2e*cos  at  +  26*sin  at} 

Looking  at  just  one  lobe  of  the  sinewave,  we  see 

that  the  area  under  it  is  46/a.  For  e  =6  _  the 

max  max 

maximum  area  under  a  lobe  is  5.78/a.  Allowing  a 
maximum  allowable  one-lobe  area  of  .025  (the  impulse 
has  unit  area),  we  get  the  relationship 


SIGNATRON^ 


2-2 


5.76 


max 


S  .025 


or 


3. 


4. 


5. 


e 

max 


6 

max 


.005  a 


From  this  it  can  be  seen  that  the  required  frequency 
precision  is  i/2%  of  measured  value « 

Accuracy  requirements  reflect  how  closely  we  wish 
to  know  the-  true  parameters  for  the  resonance. 

Suitable  criteria  appear  to  be  the  DL's  for  for¬ 
mant  frequencies  and  bandwidths  as  reported  by 
Flanagan  (1965,  pp.  212-213)  in  discussing  his  own 
(Flanagan,  1955)  and  Stevens*  (1952)  experiments. 
Frequency  DL's  of  3  to  5  percent  and  bandwidth 
DL*s  of  20  to  40  percent  are  just  discriminate. 

Operating  Range  and  Characteristics 

The  maximum  frequency  of  10  kc  was  chosen  so  that 
there  would  be  ample  resolution  for  extracting  timing 
information  from  the  glottal  signal.  Lieberman 
(1961)  has  noted  interesting  laryngeal  behaviour 
which  produces  timing  shifts  in  the  glottal  pulse 
of  the  order  of  tens  of  milliseconds. 

The  lower  bound  is  chosen  such  that  there  will  be  a 
stable  base-line  ever  several  pitch  periods  yet  the 
complexities  of  going  to  DC  operation  will  be  avoided. 

The  delay  criteria  was  chosen  so  as  to  preserve  tim¬ 
ing  information  as  discussed  above. 

The  amplitude  criteria  was  chosen  so  that  observed 
amplitudes  in  unsupressed  components,  such  as  the 
one  due  to  larynx-vocal  tract  interaction,  will  ba 
accurate  to  approximately  3%. 

Gain 

In  modeling  the  vocal  tract  as  an  acoustic  system, 
one  notes  that  its  transmission  at  DC  is  unity. 

Thus,  its  inverse  should  also  have  the  capability 
of  being  adjusted  to  unity  transmission  at  DC. 

Signal-to-Noise  Ratio 

Chosen  to  match  performance  characteristics  of 
other  audio  equipment  and  be  reasonable  in  terms  of 
the  technology  utilized. 


f 


SIGNATRON® 


2-3 


2.2.2  Other  Design  Considerations 


An  important  consideration  in  the  design  of  VOTIF  wa3  the 
use  of  resistive  controls.  In  the  present  circuitry  this  gives 
the  precision  and  accuracy  of  adjustment  desired  and  allows  for 
adjustment  and  calibration  by  appropriate  resistive  trimming. 

The  use  of  resistors  also  has  implications  for  extending  the 
capability  of  VOTIF.  One  extension  is  to  provide  for  automatic 
recording  of  the  frequency  and  bandwidth  settings.  This  can 
be  achieved  either  by  momentary  switching  of  an  adjustment  re¬ 
sistor  from  the  filtering  circuit  to  a  measuring  circuit  or  by 
adding  a  third  gang  to  each  pot  for  continuous  connection  to  the 
measuring  circuit.  For  automatic  adjustment  of  the  filtering 
circuits,  the  potentiometers  could  be  replaced  by  digital  atten¬ 
uators.  These  attenuators  are  merely  D-to-A  converters  in  which 
the  constant  reference  source  has  been  replaced  by  the  signal  to 
be  attenuated. 

A  design  objective  which  was  rejected  after  careful  considera¬ 
tion  was  the  implementation  of  units  that  could  be  switched 
between  null  and  resonance  behavior.  Considered  for  implementa¬ 
tion  was  the  use  of  one  type  of  circuit  either  directly  or  in  a 
feedback  loop,  to  get  its  inverse.  The  strict  constraints  on 
phase  over  the  wide  bandwidth  of  the  instrumentation  obviated 
this  approach.  Hence,  two  separate  types  of  units  were  designed 
and  built. 

2.3  Use  of  VOTIF 

2.3.1  Planned  Use  on  Speech 

The  use  of  VOTIF  on  natural  speech  requires  the  implementa¬ 
tion  of  a  distortion-free  means  for  repeating  short  segments  of 
the  signal  to  be  analyzed.  The  segments  should  be  several  pitch 
periods  in  length  so  that  any  initial  transients  may  die  out. 
However,  the  segments  should  be  short  enough  so  that  the  repetition 
rate  is  adequate.  An  adequate  repetition  rate  will  permit  close 
coordination  of  filter  adjustment  and  observation  of  the  effect 

SIGNATROi# 


2-4 


— 'Miijp^jpiggagigge-iaift-iui  uiis 


i 

i 

? 

I 

* 

9 

i 

,  f 

s 

1 

of  the  adjustment.  One  would  also  like  to  avoid  flicker  but  j 

I 

this  is  not  generally  obtainable  with  low  pitch  signals.  Be-  I 

sides  reproducing  the  speec*.  signal,  the  repetition  instruments-  j 

tion  should  provide  signals  for  jitter-free  triggering  of  the  : 

display.  Two  means  for  implementing  the  desired  signal  repro¬ 
ducing  instrumentation  are  discussed  in  the  following.  Neither 
was  implemented  nor  tested  as  part  of  the  work  performed.  Rather, 

VOTIF  was  tested  with  synthetic  signals. 

Previous  applications  of  the  inverse  filters  have  utilized 
FM  tape  reproducers  for  repetitive  presentation  of  the  signal 
to  be  analyzed  (Lindquist,  1964).  FM  is  used  where  AM  cannot 
be,  because  the  FM  techniques  preserve  waveform  whereas  AM  tech¬ 
niques  introduce  appreciable  phase  distortion  in  order  to  preserve 
relatively  flat  amplitude  vs  frequency  characteristics.  Tape 
recording  techniques  do  however  possess  the  drawback  that  the 
mechanical  design  requires  tape  loops  of  lengths  which  keep  the 
repitition  rate  low.  In  addition,  there  would  be  problems  in¬ 
dexing  through  long  signals  so  as  to  give  an  analysis  of  many 
consecutive  periods  of  a  long  vocalization.  There  also  is  a 
question  of  the  stability  of  the  recording  tape  and  the  reproduced 
signal  from  period  v.o  period. 

An  alternative  approach  is  to  use  a  digitally  stored  repre¬ 
sentation  of  the  signal  to  be  analyzed.  Repetitive  D-to-A  conver¬ 
sion  is  performed  to  obtain  the  analog  signal  for  analysis. 

When  the  digital  signal  has  been  obtained  directly  or  from  an 
FM  recording,  the  requirement  for  a  phase-distortion-free  signal 
is  met.  Long  utterances  recorded  on  digital  tape  or  disk  may 
be  easily  indexed  to  provide  continuous  analysis  and  the  actual 
segment  length  repeated  can  be  chosen  to  optimize  the  analysis. 

At  a  10  kHz  sampling  rate,  only  1000  storage  locations  are 
needed  to  provide  a  tenth  of  a  second  segment,  which  would 
provide  at  least  two  full  pitch  periods  of  a  pitch  having  as 
low  a  frequency  as  50  Hz.  With  the  present  general  availability 
of  digital  hardware,  this  approach  is  highly  advisable. 

SIGNATROi# 


2-5 


2.3.2  Use  of  VOTIF  on  Synthetic  Signals 


To  demonstrate  the  use  of  VOTIF  In  processing  signals,  two 
types  of  experiments  were  run.  In  the  first,  the  cascade  of  a 
VOTIF  resonance  and  a  VOTIF  null  were  excited  by  a  pulse  generator, 
to  demonstrate  the  inverse  characteristics  of  these  two  types 
of  networks.  In  the  second  experiment,  a  synthetic  two-formant 
vowel  was  analyzed. 

The  results  of  the  experiment  with  the  paired  VOTIF  resonance 
and  VOTIF  anti-resonance  are  illustrated  in  the  three  photographs 
of  Figure  2.1.  These  photographs  show  the  VOTIF  input  and  output 
for  three  different  conditions.  In  all  pictures  the  bottom 
oscilloscope  trace  is  the  pulse  generator  input  signal  to  the 
system;  the  top,  the  processed  signal.  The  pulses  come  from  a 
General  Radio  Model  1340  generator  and  are  1  msec  wide  and  occur 
at  a  rate  of  100  pps. 

In  the  top  photograph  (Fig.  2,1a)  only  the  resonance  unit 
is  in  the  circuit.  It  is  set  for  a  frequency  of  3600  Hz  and  a 
bandwidth  of  665  Hz.  In  Fig.  2.1b,  the  null  has  been  switched 
into  the  cascade  following  the  resonance.  The  null  is  set  to 
F  =  3350  and  BW  =  675,  giving  only  partial  cancellation  due  to 
the  frequency  mistuning  of  7%. 

In  Fig.  2.1c,  the  resonance  has  been  totally  cancelled  with 
the  null  set  to  F  =  3650  and  BW  =  675,  The  null  settings  differ 
from  the  resonance  settings  by  about  1.5%  in  both  frequency  and 
bandwidth.  This  is  well  within  the  design  specifications.  There 
is  slight  overshoot  at  the  edges  of  the  pulse  due  to  incomplete 
cancellation  for  the  very  large  derivatives  occurring  at  these 
edges.  The  system  noise  tends  to  widen  the  oscilloscope  trace. 

In  the  second  experiment, a  two  formant  synthetic  vowel 
sound  was  analyzed  using  null  units  only.  The  signal  was  generated 
by  a  Bell  System  Science  Experiment  No.  3,  speech  synthesizer  and 
the  above-referenced  pulse  generator. 

SIGNATRON® 


2-6 


\ 


i 


i 


a)  uncanceiled 
resonance  of 
F  =  3600  He 
BW  =  665  Hz 


b)  Partially 
cancelled 
resonance  with 
null  of 
F  =  3350  Hz 
BW  =  675 


c)  Cancelled 

resonance  with 
null  of 
F  =  3650  Hz 
BW  =  675  Hz 


Fig.  2.1 

Cancellation  of  VOTIF 
Resonance  by  VOTIF  Null 
with  1  msec,  100  pps 
pulse  input.  Bottom 
trace  of  all  pictures 
shows  pulses. 

SIGNATRON® 


2-7 


The  synthesizer  utilizes  RLC  tuned  circuits  to  simulate  the  for¬ 
mants.  An  external  pulse  generator  was  used,  for  the  periodic  source. 
Low  pass  filters  were  used  at  both  the  input  and  at  the  output  of 
the  cascaded  nulls,  to  help  reduce  noise.  The  results  of  the 
experiment  are  illustrated  in  the  three  photographs  of  Figure  2.2. 

In  all  pictures,  the  bottom  trace  is  the  unprocessed  signal.  The 
repetition  rate  is  100  pps. 

In  Fig.  2.2a  we  shew  the  effect  of  removing  the  first  formant 
at  F  =  695  and  BW  =  150.  What  remains  is  the  damped  exponential 
for  the  second  formant.  In  Fig.  2.2b,  we  show  the  effect  of 
cancelling  the  second  formant  at  F  =  1440  and  BW  =  740.  What  re¬ 
mains  in  this  case  is  the  first  formant.  The  similarity  of  the 
second  formant  to  the  unprocessed  signal  indicates  the  weakness  of 
the  second  formant  produced  by  the  synthesizer. 

Figure  2.2c  illustrates  the  cancellation  of  both  formants. 

The  resulting  pulse  represents  the  original  source  pulse  as  modi¬ 
fied  by  the  amplifiers  and  low-pass  filters.  Unlike  natural  speech, 
the  synthetic  source  is  a  sharp-edge  pulse  of  short  duration  and 
when  rederived  by  inverse  filtering  exhibits  spike  type  overshoot 
as  discussed  in  the  previous  experiment.  The  noise  in  the  inverse 
filtered  signal  is  high  frequency  synthesizer  and  amplifier  noise, 
amplified  by  the  rising  gain-frequency  characteristics  of  the  nulls. 
The  noise  may  appear  to  be  sinusoidal  because  of  a  transfer  func¬ 
tion  peak  around  18  kHz  caused  by  the  intersection  of  the  rising 
null  gain  with  the  18  kHz  low-pass  filter  in  the  null  output  stage. 

It  should  be  noted  that  in  adjusting  the  filter  units  it  is  impor¬ 
tant  not  to  overload  the  internal  circuits  of  each  filter.  The 
test  points  described  in  the  appendix  are  particularly  useful  for 
monitoring  for  overload. 


SIGNATRONe 


2-8 


cancellation  of 
f  iiT St  f  Oxluant  i 
F  =  695  Hz 
BW  =  150  Hz 


b)  Signal  after 
cancellation  of 
second  formant 
F  =  1440  Hz 
BW  =  750  Hz 


c)  Signal  after 
cancellation 
of  both 
formants 


Fig.  2.2 

VOTIF  analysis  of 
two-formant  syn¬ 
thetic  speech. 
Bottom  trace  of 
all  pictures  shows 
synthetic  vowel. 

© 


SIGNATRON' 


III.  SOURCE  SYSTEM  INTERACTION  IN  THE  CHANNEL  VOCODER 

Source-systems  interaction  in  the  channel  vocoder  is  the  effect 
of  the  repetition  rate  of  the  source  on  the  output  of  the  channel 
filters.  As  there  are  channel  filters  in  both  the  analyzer  and  the 
synthesizer  portion  of  the  vocoder  it  may  occur  in  both.  In  the 
analyzer  the  interaction  would  be  that  between  the  human  vocal 
source  and  the  analyzing  filters.  In  the  synthesizer,  it  is  that 
between  the  synthesizer  buzz  source  and  the  synthesis  filters. 

If  this  interaction  were  to  take  place  in  both  the  analyzer 
and  the  synthesizer  it  would  distort  the  spectrum  of  the  synethesized 
speech.  It  must,  therefore,  be  compensated  during  either  analysis 
or  synthesis.  In  presently  used  vocoder  techniques, it  is  compensated 
in  the  synthesizer  by  spectrum  flattening.  This  means  that  the 
channel  signals  transmitted  from  analyzer  to  synthesizer  carry  some 
unnecessary  information  about  the  pitch  rate.  To  give  an  indica¬ 
tion  of  the  amount  of  the  source-system  interaction  component  in 
the  channel  signals  and  the  needed  amount  of  correction  at  the 
synthesizer,  the  following  section  presents  a  calculation  of  this 
component.  The  section  after  next  discusses  the  effect  of  spectrum 
flattening  on  the  resulting  synthesized  signal  in  terms  of  both  the 
degree  of  normalization  of  power  and  the  modification  of  signal 
peak  factor. 

3 . i  The  Effect  of  Pitch  Rate  on  Channel  Filter  Output 

The  effect  of  pitch  rate  on  channel  filter  output  is  a  function 
of  the  number  of  components  passing  through  a  particular  filter  and 
the  expected  number.  For  a  pulse  rate  of  w0  radians/second  we  would 
expect  a  filter  of  0  radians  bandwidth  to  pass  Q/u)^  components,  which 
is  not  necessarily  integer.  However,  the  actual  number  of  components 
passed  must  be  an  integer  and  is  given  by 


N 


_  r-U)  , 

r  n  ■  t 

L«£j "  L^G-i 


(3.1) 


SIGNATRON'5, 


3-1 


where  u;  and  uu.  are  the  upper  and  lower  limits  of  the  passband, 
respectively.  They  are  related  by 

fi  =  to  -  uu,  (3.2) 

U  v 

From  these  formulas  we  see  that  N  is  bounded  as  follows: 

|N-~-|  <  1  (3.3) 

o 

The  absolute  difference  between  N  and  fi/u)0  may  actually  approach 
arbitrarily  close  to  1. 

If  we  consider  that  the  interaction  is  the  ratio  of  the  actual 
signal  power  passed  by  the  filter  to  the  expected  signal  power  and 
that  each  component  adds  one  unit  of  power  we  would  get 

XdE  =  10  lo%0  [Nir]  (3-4) 

From  Eq.  (3.3)  we  get  a  bound  on  I 

10  Io%o  Km)  £  xab  £  10  109io  (m)  <3-5) 

The  upper  bound  does  ncc  exist  for  N=l. 

The  possible  range  of  I  for  various  3mall  values  of  N  is  given 
in  Table  3.1.  The  values  of  N  represented  in  the  table  are  typical 
for  the  number  of  components  that  fall  in  the  various  channel  bands 
in  vocoders. 

The  interaction  for  N  from  1  to  3  is  of  the  order  of  the  quan¬ 
tum  step  used  in  quantizing  the  vocoder  analyser  output.  This  indi¬ 
cates  that  different  pitches  could  result  in  more  than  one  pattern  of 
digits  for  a  given  articulation  of  a  particular  speaker.  The  inter¬ 
action  may  also  be  interpreted  as  the  error  which  exists  if  spectrum 
flattening  or  some  similar  form  of  compensation  is  not  used  in  a 
vocoder  system.  That  this  error  is  appreciable  can  be  demonstrated 

SI  GNAT  RON'"'' 


3-2 


by  the  subjective  improvements  obtained  by  using  spectrum  flattening. 
Both  effects  of  this  type  of  interaction  are  increased  by  the 
dynamics  of  changing  pitch.  Thus  changes  of  the  order  of  the  ranges 
listed  in  the  table  below  would  occur  every  time  a  pitch  change 
caused  a  component  to  move  from  one  band  to  an  adjacent  one. 

Table  3-1 

VARIATION  OF  FILTER  OUTPUT  INTENSITY 
FROM  EXPECTED  VALUE  AS  A  FUNCTION  OF 
THE  NUMBER  OF  COMPONENTS  PASSED 


N 

^min 

^max 

Range 

(dB) 

(dB) 

(dB) 

1 

-3.0 

- 

- 

2 

-1.8 

3.0 

4.8 

3 

-1.2 

1.8 

3.0 

4 

0.9 

1.2 

2.1 

SIGNATRON® 


3-3 


3.2  The  Effect  of  Spectrum  Flattening  on  the  Synthesized  Signal 


Spectrum  flattening  as  performed  in  channel  vocoder  synthe¬ 
sisers  is  achieved  by  distorting  the  waveform  of  the  signal.  In 
analyzing  spectrum  flattening,  one  should  investigate  the  effect 
of  the  flattening  on  the  shape  of  waveform  as  well  as  on  the 
power  of  the  waveform.  In  the  following,  we  examine  peak  factor  — 
the  ratio  of  peak  signal  to  signal  power  —  as  an  indicator  of  the 
effect  on  the  waveform. 

A  model  of  a  single  channel  of  a  vocoder  is  shown  in  Fig. 

3.1.  The  two  bandpass  filters  are  identical,  with  the  result 
that  the  same  frequency  components  appear  at  both  and 

(©  .  but  with  their  strengths  changed.  Due  to  the  action  of  the 
infinite  clipper  many  more  components  appear  at  (2)  .  The  power 
at  (b)  is  one  because  the  signal  there  is  always  either  +1  or  -1. 
Because  some  of  the  components  contributing  to  this  power  do  not 
pass  through  BPF2>  the  power  at  ©  is  actually  lower  than  the  tar¬ 
get  value  of  unity. 

For  a  constant  frequency  impulse  source  and  ideal  bandpass 
filters  the  signal  at  A  is 


where  N  =  number  of  components  passed-  by  the  filter, 

u>  -  radian  pulsing  frequency  i.e.f  difference  in  frequency 
'  between  adjacent  components,  and 

u)c  =  is  the  center  frequency  of  the  passed  components. 

When  the  number  of  components  N  is  odd,  u>  is  the  frequency  of  the 

C 

center  component.  When  N  is  even,  is  the  average  of  the  two 
innermost  components. 

Because  of  the  even  symmetry,  the  peak-signal  occurs  for  t  =  0 
and  has  a  value  N,  which  is  actually  the  sum  of  the  N  equal  amplitude 

SIGNATRON® 


3-4 


chonnel 

gain 

control 


source 

input 

Bandpass 

i 

Infinitel  > 

Filter  1 

Clipper  j 

© 


Filter  2 


-<*> 


output 

-►for 

summation 


FIG.  3-1  MODEL  OF  SINGLE  CHANNEL  OF  SPECTRUM 
FLATTENING  SYNTHESIZER 


SIGNATRON® 


3-5 


components.  The  total  power  is  N  times the  power  in  a  single  com¬ 
ponent,  this  power  being  normalized  to  one.  Thus  the  peak  factor, 
defined  as 


PF  =  10  log10 


is  10  log1Q  N* 


(3.7) 


Because  the  signal  symmetry  is  maintained  during  the  clipping 
operation  and  subsequent  filtering,  the  peak  signal  value  con¬ 
tinues  to  be  the  sum  of  the  individual  component  amplitudes.  The 
power  is  the  Siam  of  the  squares  of  these  amplii  ides,  giving 


where  cn  is  an  individual  component  amplitude. 

To  obtain  the  value  of  the  components  we  analyze  sgn  [sA(t)], 
as  given  by  Eq.  (3.6),  ?t  each  of  the  components.  We  define 
sgn  {•}  as 


sgn  £x} 


for  x  >  0 
for  x  =  0 
for  x  <  0 


(3.9) 


The  strength  of  the  component  is 


,  fN0\ 

1  r"  f  r-sin^  2  )  «  1 

c+n  =  2tt  J  lsgni_ - 7gT~  :  cos(n0)|  (cos(pe)  sgnCcos(P 9 ) ] }  d0 

-rr  sin(  2/  “* 


7 

2tt 


r"  f  1 

I  isgnL - 


-TT 


sin(n9)}  (sin(P0)  sgn[cos(P0) ]}  d0 
.  f0\-*  ; 

sin\ 2/  (3.10) 


SIGNATP.ON 


3-6 


where  9  =  uj  t 
o 

(!) 

0  =  -<=  >  1 
(I)  — 

O 


n  for  N  odd 


n 


l+2n 


for  N  even 


The  magnitude  of  the  index  of  c  indicates  the  distance  of  the  com¬ 
ponent  being  evaluated  from  ui  ,  the  center  frequency  of  the  components. 

C 

The  sign  indicates  whether  the  component  is  lower  or  higher  than  the 
center. 

For  p  »  N,  we  can  replace  the  second  terms  in  each  integral 
of  Eq. (3.10)  by  their  averages  which  are  —  and  0,  for  the  first  and 
second  integrals,  respectively.  Thus,  for  large  p,  the  strength  of 
components  equidistant  from  <dc  would  be  equal.  This  is  to  be  ex¬ 
pected  because  letting  p  »  N  is  equivalent  to  saying  that  the  center 
frequency  of  the  passband  is  much  higher  than  its  bandwidth  and  non¬ 
linear  distortion  does  not  cause  interaction  between  symmetrical 
components. 

The  evaluation  of  the  integrals  of  (3.10)  is  accomplished  by 
piece-wise  summing  integrals  of  that  portion  of  the  argument  where 
the  sgn  {•}  functions  in  the  integral  do  not  change  sign.  Because 
of  symmetry,  it  is  necessary  to  integrate  only  from  0  to  it.  This 
allows  the  reduction 


&F1  -  -  [-  (¥•)] 


sgn 


4  9 

sin  y 


Mi. 


(3.11) 


Thus  Eq.  (3.10) reduces  to 


n 


1  4-  P  1  +  1 

=  ~  E  SGN ( 0 ^  )  j  cos  (n0)  cos  (p8)  de 

i  rt 


0 

E  SGN(ei+)  J  141  sin  (n6)  sin  (P6) 


d9 


SIGNATRON 


(3.12) 


3-7 


where 


SGN{8)  =  sgn  J^sin  sgn  [cos  (p9)] 


and  where  the  b^'3  define  points  of  change  of  SGN(8)  for  0 
The  integrals  in  Sq.  (3.12)  have  the  values 


<  8  <  tt. 


J  cos  (n8)  cos  (p9)  d8 


J  sin  (n8)  sin  (p9)  d8 


(sin  C(p  -  n)8]  +  sin  [ ( P  +  n) 8] ^ 
I  2{p  -  n)  2(0  +  n) 

j  I  +  sin(  2&8.) 

2  2P  *  H 

W 

(sin  [  (p  =  n)8]  _  sin  [  (p  +  n)8]^ 

J  2(p  -  n)  2(p  +  n) 

1  6.  sin  (2P8)  D 


A 

=  n 


A 

=  n 

(3,13a) 

A 

=  n 


A 

=  n 

(3.13b) 


Thus  the  evaluation  of  the  component  strengths  can  be  reduced  to  a 
summation  which  can  be  performed  on  a  computer.  The  computer  can 
also  be  programmed  to  determine  the  8^'s, 

We  now  consider  calculation  of  the  limiting  case  of  P  °  to 
derive  formulas  which  not  only  give  us  additional  feeling  for  the 
mathematics  but  also  provide  a  means  for  checking  calculations  per¬ 
formed  according  to  the  above  equations.  As  above,, we  integrate 

from  0  to  n  and  reduce  the  second  term  of  the  integral  to  the  con- 
2 

st ant  This  gives 


cn  =  -^2  J+  sgn  £sin  ("sr)J  cos  (n8)  d9  (3.14) 

TT 

O 

This  is  further  reduced  to  a  summation  by  the  piece-wise  integration 
methods  described  above.  This  gives 


SIGNATHON® 


3-8 


for  N  =  l 


cn=\  L2T1  2(k-H)Tr 


N 


v  a  ±  •  ■ 1  _  -ft 

~  E  (-1)*  f  N  cos  (n0)  d0  +  (-1)^  f  cos  (n0)  d0 


TT  k=0 


J 

2kTT 

■  N  ' 


for  N  >  2 

where  [x]  =  integer  value  of  x. 


(3.15) 


The  integrals  may  be  reduced  using 

r 


2(k+l)N 

r  N 

J  cos  (n0)  d0  =, 

2ktr 


2 tt 
N 


for  n  =  0 


N 


i  {3in  .  sin  pap]} 


(3.16) 


for  n  4  0 

Which  leads  to  an  easily  implemented  computational  procedure. 


The  results  of  the  computations  outlined  above  are  shown  in 
Figs.  (3.2  and  3.3).  In  Fig.  (3,2)  is  shown  the  power  in  the  com¬ 
ponents  after  spectrum  flattening,  for  various  values  of  p .  We 
can  see  that  the  spectrum  flattening  achieves  its  objective  to 
within  2.5  dB.  As  noted  above,  the  power  output  is  leas  than  0  dB 
because  the  bandpass  filter  after  the  clipper  removes  some  of  the 
components  which  contributed  to  the  0  dB  power  level  at  the  output 
of  the  clipper. 


In  Fig.  (3.3),  the  peak  factor  of  the  channel  output  signals 
is  shown.  In  this  case  the  computed  peak  factor  is  within  2  dB  of 
the  peak  factor  obtained  without  clipping.  The  conclusion  to  be 
drawn  is  that  spectrum  flattening,  as  modeled  above,  is  an  effective 
way  of  dealing  with  source-system  interaction  in  channel  vocoders. 
This  acceptance  is  conditioned  on  there  being  no  source-system 
interaction  distortion  in  the  encoding  process,  as  discussed  above. 


SIGNATRON® 


3-9 


Number  of 
Components 


FIG.  3-2  EFFECT  OF  SPECTRUM  FLATTENING  ON 
CHANNEL  POWER. 

p  is  the  ratio  of  center  frequency  to  fundamental 
frequency. 


SI GNAT RON® 


FIG.  3-3  EFFECT  OF  SPECTRUM  FLATTENING  ON  PEAK  FACTOR. 
p  is  the  ratio  of  center  frequency  to  fundamental  frequency. 


SIGNATRON® 


3-11 


IV,  PULI3ING  OF  RESONATORS 


Our  interest  in  the  periodic  or  quasi-periodic  impulsing  of  a 
harmonic  oscillator  or  resonator  derives  from  its  similarity  to  the 
vowel  production  process.  For  most  vowels,  the  first  formant  dom¬ 
inates  the  generated  signal.  Hence,  we  may  hope  to  obtain  interesting 
results  from  the  study  of  a  single  oscillator.  In  actual  speech 
production  the  oscillation  appears  to  derive  its  excitation  from  a 
single  discontinuity  in  the  glottal  pulse.  This  discontinuity  can 
be  replaced  by  an  impulse  if  the  resulting  amplitude  is  scaled  by 
the  appropriate  power  of  the  frequency  of  oscillation  and  the  phase 
is  shifted  by  a  multiple  of  tt/2  radians.  The  power  to  which  the 
frequency  is  raised  and  the  multiplier  of  the  phase  snift  is  equal 
to  the  order  of  the  discontinuity.  In  the  discussion  which  fel¬ 
lows,  this  compensation  oc  amplitude  and  phase  is  unimportant. 

In  what  follows,  we  examine  how  the  amplitude  of  the  oscilla¬ 
tion  varies  as  a  function  of  the  relationship  between  the  pulse 
rate  and  the  oscillator  frequency.  In  a  second  section,  we  ex¬ 
plain  how  appropriate  alternation  of  short  and  long  inter-pulse 
periods  may  moderate  maxima  or  minima  of  resonator  response. 

4.1  Periodic  Pulsing  of  a  Resonator 

As  was  described  in  a  paper  by  House  (1959)  changing  pulse 
rate,  while  holding  the  resonance  characteristics  constant,  produces 
fluctuations  in  the  amplitude  of  the  signal  transmitted  through  the 
resonance.  This  can  be  explained  by  Fig.  4.1  in  which  we  show  how  the 
transmission  function  of  a  resonance  effects  the  amplitude  of  the 
components  of  impulse  trains  of  two  different  frequencies  as  shown 
by  solid  and  dashed  lines  respectively.  The  pulse  rate  represented 
by  the  dashed  line  will  produce  a  larger  output  than  the  other  be¬ 
cause  a  component  falls  at  the  peak  of  the  transmission. 

Another  way  of  examining  this  phenomenon  is  in  terms  of  the 
complex  representation  for  the  behavior  of  the  harmonic  oscillator 
between  pulses: 


SIGNATRQN® 


4-1 


relative 

amplitude 


FIG.  4-1  TRANSMISSION  OF  COMPONENTS  BY  A  RESONANCE 


Im[£  (t)] 


Re  [£(t)] 


FIG.  4-2  HARMONIC  OSCILLATOR  BEHAVIOR 

SIGNATRON^ 


4-2 


i  ,J5'V^>KF' 


1  lj  ?  h  t-f 


5(t)  =  Ae(0  + JU,)c  (4.1) 

where  A  is  a  complex  amplitude.  The  real  signal  which  would  actual¬ 
ly  be  obtained  from  a  resonator  is  the  real  part  of  .this  complex 
signal.  If  we  pulse  the  oscillator  every  T  seconds  with  a  real 
valued  pulse  of  amplitude  p,  the  steady-state  oscillator  behavior 
may  be  described  by  the  equation 


A 


AeoT+^T 


+  P 


(4.2) 


This  equation  indicates  that  the  ringing  of  the  oscillator  starts 
at  a  value  A  and  rings  for  T  seconds  until  it  achieves  a  value 
A  exp  [crT+  juuT].  At  such  time  a  pulse  p  is  used  to  re-obtain  the 
initial  oscillator  amplitude  and  a  new  period  of  decay  begins. 


Equation  (4.2)  is  illustrated  geometrically  in  Fig.  4.2.  The 
spiral  shows  the  locus  of  §(t)  over  the  interval  T.  The  real  sig¬ 
nal  is  the  real  axis  projection  of  the  vector  whose  tip  follows  the 
spiral.  The  rotation  is  the  angular  change  of  the  sinusoid  while 
the  decreasing  diameter  of  the  spiral  is  the  exponential  decay  of 
the  amplitude  of  the  sinusoid.  The  angle  0  between  the  two  vectors 
is  the  total  rotational  angle  modulo  2n. 


The  response  or  ratio  of  the  oscillator  amplitude  to  the  pulse 
amplitude  may  be  obtained  from  Eq.  (4.2): 


A  _  1 _ 

P  i  _  eaT  +  jU,T 


(4.3) 


From  this  we  may  obtain  the  squared  magnitude  of  the  response. 


|A|2 

P 


_  OT  m  20T 
2e  cosujT  +  e 


(4.4) 


Note  that  this  equation  could  also  be  obtained  by  applying  trigonom¬ 
etry  ro  the  vector  diagram  in  Fig.  4.2. 


3IGNATR0N® 


4-3 


From  Eq.  (4.4)  it  can  be  seen  that  the  magnitude  of  the  re¬ 
sponse  oscillates  between  maxima  and  minima  as  tuT  changes  through 
successive  multiples  of  tt.  We  have  minima 


|~j  2  B  _ 1 _ 

IpI  (1  +  oCTT ) 2 


(4.5) 


for  u)T  =  (2n+l)rr 


and  maxima 


A. 2 

Pi  ~  (1  -  eaT) 2 


1 


(4.6) 


for  tuT  =  2nn 

This  alternating  maximization  and  minimization  of  the  response  is 
the  same  as  that  predicted  by  our  previous  discussion  of  frequency 
components  and  calculated  in  detail  by  House  (1559). 


A  set  of  curves  depicting  Eq.  (4.4)  is  given  in  Fig.  4.3.  In 
labeling  these  curves  we  have  used  the  relationships 


2rrFT  =  uuT 


EW-T  =  2-T 

where  F  and  BW  are  the  frequency  and  bandwidth  of  the  resonator, 
respectively,  and  T  is  the  period  of  the  pulses.  The  amplitude  of 
the  response  in  dB  is  shown  on  the  .vertical  axis  and  the  normalized 
quantity  BW-T  on  the  horizontal  axis.  The  functional  relationship 
between  these  two  quantities  is  shown  for  six  values  of  2ttft,  the 
argument  of  the  cosine  in  Eq.  (4.4).  At  BW  =  .5  there  is  a  change 
of  vertical  scale. 

The  open  circles  and  dashed  line  on  the  graph  illustrate  how 
it  is  used  to  obtain  the  response  for  a  fixed  resonator  as  the 
pulse  rate  is  varied.  (Pulse  rate  is  the  reciprocal  of  T.)  The 
illustration  is  for  F  =  300  and  BW  =  50.  Each  circle  represents 
a  different  frequency.  Scanning  from  left  to  right  the  maximum  of 
response  occur  at  300  Hz,  150  Hz,  and  100  Hz;  the  minima  at  200  Hz 

SIGNATRON® 


4-4 


and  120  Hz„  This  curve  is  valid  for  all  resonators  having  the 
same  Q  i.e.  .  the  same  ratio  of  F  to  BW.  However,  the  circles  would 
represent  different  pulse  rates.  Thus  this  curve  shows  the  be¬ 
havior  for  F  =  600  and  Bid  =  100,  but  all  the  pulse  frequencies  cited 
should  be  doubled. 

The  extent  of  the  response  change  from  maximum  to  minimum  can 
be  red’cec  if  the  driving  pulses  occur  at  intervals  which  are 
alternately  shorter  and  longer.  This  is  discussed  in  the  next  section. 

4.2  Alternate  Pulsing  of  A  Resonator 

In  commenting  on  the  appreciable  change,  an  the  response  of  a 
resonator,  we  imply  that  perhaps  the  resulting  maxima  or  minima 
are  undesirable  features  of  our  model  of  speech  production  which 
actually  do  not  exist  because  of  some  physical  or  neurological 
mechanism  in  the  actual  human  speech  production  system.  We  thus 
are  interested  in  simple  models  for  reducing  the  height  of  the 
maxima  or  depth  of  the  minima.  As  will  be  shown  in  what  follows, 
the  replacement  of  the  constant  period  pulse  source  by  one  whose 
pulses  occur  at  alternately  short  and  long  intervals  gives  such  a 
reduction.  The  interest  in  such  a  model  is  increased  as  a  result 
of  the  obscuration  that  such  alternations  actually  occur  in  human 
speech  (Lieberman,  1961;  Smith,  1968).  In  the  discussion  that 
follows  we  will  discuss  alternation  as  a  means  of  increasing  the 
response  during  what  would  otherwise  be  minima.  Such  a  discussion 
is  based  on  a  premise  that  optimal  speech  production  is  that  with 
the  greatest  amplitude.  The  alternative  is  that  alternation  works 
to  lower  response  maxima  which  also  correspond  to  maxima  of  the 
impedance  presented  to  the  larynx  by  the  vocal  tract.  While  we  do 
not  orient  our  discussion  to  this  latter  case  all  the  same  prin¬ 
ciples  apply  and  the  same  equations  may  be  used  to  measure  the 
effect . 

A  model  for  the  generation  of  alternating  pulses  is  shown  in 
Fig.  4.4.  A  pulse  generator,  operating  at  a  rate  equal  to  half  the 
number  of  pulses  per  second  we  desire,  drives  a  linear  system  whose 

SIGNATRON^ 


4-6 


~:FrFZrZ  T  v  !  ?.TT^5lj‘ 


4 

1 


Pitch  Gen. 

Linear  System 

Period*  2T 

h(t)  *  u0(  t )  ♦  u0  [t-(T  ♦A)] 

pulses  occurring 
ct  on  average 
period  T 


FIG.  4.4  MODEL  FOR  GENERATION  OF  ALTERNATING 
PERIOD  PULSES 


4-7 


SIGNATRON® 


output  is  two  pulses  for  every  pulse  in.  The  pulses  occu*  at  the 
average  rate  we  desire  and  have  inter-pulse  intervals  which  alter¬ 
nate  between  (t+a)  and  (T-A)  seconds.  The  affect  of  the  alterna¬ 
tion  may  be  seen  by  considering  how  the  frequency  components  of 
the  pulse  generator  are  affected  by  the  filter. 

The  frequency  components  occur  at  multiples  of  as  is 
indicated  by  the  vertical  lines  in  Fig.  4.5.  The  transfer  function 
of  the  dual  pulse  filter  is 

H(ju>)  =  1  +  e">(T+A)  (4.7) 

The  magnitude  of  this  transfer  function  is 

(H(juj)|  =  |cos[|(T+  A)Jj  *  2  (4.8) 

The  effect  of  different  values  of  A  can  be  seen  in  Fig,  (4.5)  where 
the  magnitude  of  the  transfer  function  is  plotted  for  A  =  0  and 
A  =  T/4. 

For  A  =  0,  the  cosine  function  cancels  all  the  odd  components. 

The  resulting  even  harmonics  are  actually  all  the  harmonics  of  a 

1 

pulse  train  of  rate  — .  This  is  actually  the  case  because, without 
the  A, we  do  have  a  constant  period  pulse  train  with  period  T.  For 
A  =  T/4  we  do,  however,  pass  with  maximum  magnitude  one  of  the  odd 
components  while  suppressing  its  even  neighbors.  Thus  if  the  peak 
of  the  resonance  were  at  A  in  Fig.  4.5  there  would  be  no  need 
to  alternate  the  pulses.  This  corresponds  to  the  situation  shown 
by  the  dashed  lines  in  Fig.  4.1,  depicting  a  component  occurring 
at  the  resonance  oeak.  The  situation  depicted  by  the  solid  lines 
in  Fig.  4.1  corresponds  to  the  resonance  peak  occurring  at  B  in 
Fig.  4.5,  half  way  between  components  of  the  average  pulse  fre¬ 
quency.  In  this  case,  a  A  of  T/4  changes  what  would  otherwise  be 
a  minimum  response  condition  to  a  maximum  by  generating  a  maximum 
component  at  the  resonance  peak.  For  peaks  which  occur  at  fre¬ 
quencies  which  are  not  multiples  of  1/2T,  the  maximum  response  can 
be  obtained  by  finding  the  component  which  is  nearest  to  the  peak 

SIGNATRON'5' 


4-8 


and  maximizing  it  by  the  proper  selection  of  A.  This  component 
is  denoted  as  the  kth  component  in  the  following  formula.  The 
formula  is 


for  k  odd 
for  k  even 


where  k  =  integer 


value  of 


(4.9) 


and  F  is  the  frequency  of  the  peak  of  the  resonance. 

The  value  of  A  is  approximately  half  the  reciprocal  of  the  resonance 
frequency. 


Theoretically,  one  could  operate  a  maximum  component  arbi¬ 
trarily  close  to  a  resonance  peak.  This  is  done  by  lowering  the 
rate  of  the  pulse  generator  in  Fig.  4.4  and  increasing  the  number 
of  pulses  in  the  impulse  response  of  the  filter  by  the  same  factor, 
to  keep  the  average  pulse  rate  the  same.  To  maximize  the  proper 
component  one  would  have  to  determine  the  correct  timing  for  every 
pulse  in  the  filter,  by  solving  sets  of  transcendental  equations. 
The  complexity  mediates  against  the  model  being  representative  of 
a  natural  process. 

The  complex  signal  representation  used  above  for  calculating 
the  response  to  truly  periodic  pulses  can  also  be  used  for  the 
alternation  situation.  Here,  however,  we  have  two  amplitudes: 

A.j  for  the  amplitude  during  the  long  period  and  A^  for  during  the 
short.  The  formulas  are  best  expressed  as  part  of  a  descriptive 
table.  To  simplify  the  notation,  we  have  set  the  amplitude  of  the 
excitation  pulses  to  unity. 


SIGNATRON- 


4-10 


I 


r 

A. 


signs!  Amplitude 


ST+SA 


instant,  of  Tima 

just  after  first  pulse[t=0(mod  2T)] 


S(t3  <*.V 


A^e 

A2  »  AieST+S4  +  1 


.  ST-SA  .  2ST ,  ST-SA 
A^e  =  A^e  +e 

.  .  2ST ,  ST-SA,, 

A1  =  Aie  +S  +1 


after  long  period 


just  after  2nd  pulse 


[t  =  T+a] 


end  of  short  period 

Ht  *  2T=  O(mod  2T)  ] 
just  after  1st  pulse! 


where  S  =  a  +  juu 

From  Eq.  (4.10)  we  obtain  the  equation  for  A1 
,  .  ST-SA 

A  =  Lie _ 

1  ,  _2ST 


(4.10) 


(4.11) 


1  -  e 

and  by  analogy 

1  .  ST+SA 

a  =  i_JLe _ 

2  ,  2ST 

1  -  e 

we  also  note  that  for  A  =  0 

1  X  ST 

A  A  =  -1  ■  e- - 

1  2  ,  2 ST 

1  -  e 


(4.12) 


ST 


(4.13) 


1  -  e" 

which  is  Eq.  (4.3)  for  constant  period  pulse  excitation 


These  expressions  can  now  be  used  to  derive  some  measure  of 
response  based  on  the  two  different  response  amplitudes.  This  most 
appropriate  measure  is  probably  the  power  averaged  over  the  short 
and  long  intervals.  The  results  of  this  calculation  cannot  be 
represented  in  simple  graphical  form  as  for  the  constant  period 
case  and  is  sufficiently  complicated  as  to  best  be  done  for  specific 
values  of  resonance  frequency  and  bandwidth. 


SIGNATRON 


4-11 


Such  a  comparison  of  resonar.or  response  power  for  alternated 
and  constant  perjod  excitation  is  shown  for  three  different 
resonator  frequencies  in  Figs.  4.6  through  4.8.  The  horizontal 
axes  show  the  2  erage  pulse  frequency  1/T.  On  the  average, 
the  resonator  power  increases  at  6  dB/octave  following  the 
input  power  from  the  constant  amplitude  excitation  pulses.  The 
curves  for  no  alternation  (A  =  0)  show  the  same  type  of  results 
given  by  House  (1959).  In  determining  the  case  for  alternated 
pitch,  the  amount  of  alternation,  a,  was  set  to  half  recip¬ 
rocal  of  the  resonance  frequency,  rather  than  the  reciprocal  of 
the  pi'  h  component  nearest  the  resonance  frequency, as  detailed 
above.  As  can  be  seen,  this  selection  of  A  makes  the  response 
to  alternated  pulses  be  180°  out  of  phase  with  the  response  to 
constant  period  pulses.  One  has  peaks  where  the  o  ler  has 
valleys  and  vice-versa.  Thus  for  any  combination  of  resonator 
and  pitch  frequencies,  resonator  response  may  be  either  maximized 
or  minimized  by  selection  of  the  proper  pitch  modes  alternated 
or  constant  period. 


SIGNATRON® 


4-12 


FIG.  4-6  RESPONSE  POWER  FOR  ALTERNATED  AND  CONSTANT 
PERIOD  PULSES  EXCITING  A  RESONATOR  OF 
F-CUOHz,  BW  =  50  Hz 


3IGNATRON® 


100 


Pulses  per  second 


* 

l 

l 

i 

I 

i 

l 

l 

I  150  ZOO 


A 


/  500 


Pulses,  per  second 
_ _  500  400 


600Hz 


REFERENCES 


Dunn,  H.  K,  :  Methods  of  Measuring  Vowel  Formant  Band.'iaths, 

J.  Acoust.  Soc.  Am.  _33,  ''737-1746  (1961). 

Fant,  G. :  Acoustic  Theory  of  Speech  Production,  1 s-Gravenhage  : 
Mouton  &  Co.  1960. 

Flanagan,  J.  L. :  A  Difference  Limen  for  Vowel  Formant 
Frequency.  J.  Acoust.  Soc.  Am.  27 _,  613-617  (1955). 

Flanagan,  J.  L.  :  Speech  Analysis.  Synthesis  and  Perception. 

New  York:  Academic  Press,  Inc.  1965. 

Holmes,  J.  N. :  An  Investigation  of  the  Volume  Velocity  Waveform 
at  the  Larynx  during  Speech  by  Means  of  an  Inverse  Filter. 

Procr  IV  Int.  Congress  Acoust.,  Copenhagen,  Denmark,  August  1962. 
Also  Proc.  Stockholm  Speech  Comm.  Seminar,  RIT,  Stockholm,  Sweden, 
September  1962. 

House,  Arthur  S«  :  A  Note  on  Optimal  Vocal  Frequency.  J.  Speech 
and  Hearing  Res.,  2,  55-60  (1959). 

Lieberman,  P.  :  Perturbations  in  Vocal  Pitch.  J„  Acoust.  Soc. 

Am.  33.  597-603  (1961). 

Lindqvist,  J. :  Inverse  Filtering  —  Inst rumentat ion  and  Tech¬ 
niques.  STL-QPSR-4/ 1964 ,  Speech  Transmission  Lcib. ,  Royal  Inst, 
of  Tech..  Stockholm.  1-4,  (1964), 

Lindqvist.  J. :  Studies  of  the  Voice  Source  by  Means  cf  Inverse 
Filtering,  STL-QPSR-2/1965,  -Speech  Transmission  Lab.,  Royal 
Institute  of  Tech.,  Stockholm.  8-13  (1965). 

Mathews.  M.  V.,  J,  E.  Miller,  and  E.  E.  David,  Jr.:  An  Accurate 
Estimate  of  the  Glottal  Waveshape.  J.  Acou3t.  Soc.  Am.  33, 

843(a)  (1961). 

Peterson,  G.  E. ,  and  H.  L.  Barney:  Control  Methods  Used  in  a 
Study  of  the  Vowels.  J.  Acoust.  Soc.  Am.  24,  175-184  (1952). 

Smith,  C.  P.  :  Private  Communi-'tion  (1968). 

Stevens,  K«  N. :  The  Perception  of  Sounds  Shaped  by  Resonance 
Circuits.  ScD  Thesis,  Massachusetts  Institute  of  Technology. 
Cambridge,  Mass.,  1952. 


SIGNATRON"' 


Appendix  A 

INSTRUCTION  MANUAL  FOR 
VOTIF  FILTERING  UNITS 


Prepared  by; 

Design  Automation,  Inc. 
d09  Massachusetts  Avenue 
Lexington,  Massachusetts  02173 


Prepared  for: 

SIGNATRON,  Inc. 

594  Marrett  Road 

Lexington,  Massachusetts  02173 


TABLE  OF  CONTENTS 


Section  No. 


Title 


1.0 

2.0 

2.1 


2.2 

2.3 

2. U 
2.5 
3.0 

3. 'l 


3.2 

3.3 
3.U 
3.5 


2.1.1 

2.1.2 

2.1.3 

2.1.U 

2.1.5 

2.1.6 

2.1.7 


3.1.1 

3.1.2 

3.1.3 
3.1.U 

3.1.5 

3.1.6 

3.1.7 


Introduction 

Null  Filter  Functional  Description 

Null  Filter  Specification  Summary 
Controls 
Accuracy 
Impedance  Levels 
Signal  Levels 
Noise  Level 

Test  Points 
Power  Drain 

Null  Filter  Operating  Instructions 

Null  Filter  Circuit  Design 

Null  Filter  Measured  Response 

Null  Filter  Maintenance  and  Calibration 

Resonance  Filter  Functional  Description 

Resonance  Filter  Specification  Summary 
Controls 
Accuracy 
Impedance  Levels 
Signal  Levels 
Noise  Level 
Test  Points 
Power  Drain 

Resonance  Filter  Operating  Instructions 

Resonance  Filter  Circuit  Design 

Resonance  Filter  Measured  Response 

Resonance  Filter  Maintenance  and 
Calibration 


h 

h 

h 

h 

5 

5 


ll 

15 

16 

17 

17 

17 

17 

17 

17 

18 
18 

18 

19 

2h 

25 


i 


-;£***£ 


LIST  OF  ILLUSTRATIONS 


Figure 

1. 

2. 

з. 

и. 


Page 


Tuning  Range  of  Frequency  and  Bandwidth  Control 

Setuings  2 

Recommended  Installation  Arrangement  3 

Simplified  Transfer -Function  Diagram  of  Null  Filter  8 

Null  Filter  Schematic  Diagram  10 

Simplified  Transfer -Function  Diagram  of  Resonance 

Unit  20 


Resonance  Filter  Schematic  Diagram 


Table 

1. 

Measured  Response  at  1000  Hz  Frequency  and  20  Hz 
Bandwidth  Settings 

12 

2. 

Measured  Noise  Output  with  Effective  DC  Gain  Set 
to  Unity  at  Various  Tuning  Frequencies 

1U 

3. 

Resonance  Filter  Bandwidth  Measurements 

2h 

ii 


INSTRUCTION  MANUAL  FOR  FILTERING  INSTRUMENT 


1.0  Introduction 


This  appendix  describes  the  design  and  operation  of  the  Null  and 
Resonance  Filters  of  the  VOTIF  speech  analyser.  Operational  instruc¬ 
tions  are  given  for  a  composite  filtering  instrument  which  consists 
of  five  Null  Filters  and  one  Resonance  Filter  connected  in  cascade. 

The  frequency  and  bandwidth  of  each  of  these  filters  may  be  aet  inde¬ 
pendently  over  the  tuning  range  shown  in  Figure  1.  Each  filter 
operates  independently  of  the  other  filters. 

The  instrument  operates  in  £0°F  to  12f>°F  ambient  temperature  without 
forced-air  cooling,  and  operates  from  a  standard  11?  VAC  60-Hz  com¬ 
mercial  power  line.  A  two-section  19-inch  rack -mounting  frame  con¬ 
tains  the  instrument  input  and  output  BNC  connector  clusters,  a  regu¬ 
lated  dual-output  power  supply,  and  quick-disconnect  ^-turn  panel- 
mount  fasteners  for  mounting  all  six  filter  units  in  the  frame. 

Shielded  cables  with  BNC  connectors  are  furnished  for  interconnection 
of  filter  units.  The  power  supply  is  an  Acopian  Model  lj?.D70U  rated 
for  dual  15V  700  mA  operation. 

Figure  2  shows  an  appropriate  installation  arrangement  for  the  units. 
Various  factors  discussed  in  subsequent  sections  affect  the  actual 
arrangement  used  in  any  gfVen  analysis  situation.  In  all  situations 
it  is  advisable  to  have  the  lowest  noise  units  earliest  in  the  chain 
to  minimize  noise  build-up.  This  noise  build-up  is  a  consequence 
of  the  rising  gain-frequency  characteristic  (12  dB/octave/null)  of 
the  instrument.  For  the  maintenance  of  highest  output  signal-to-noise 
ratio,  the  null  units  should  be  adjusted  so  that  the  tuning  fre¬ 
quencies  increase  along  the  cascade  with  the  first  unit  having  the 
lowest  frequency  setting.  However,  when  the  input  signal  is  noisy,  as 
is  often  the  case  with  speech  signals,  the  reverse  ordering  may  be  more 
advisable.  While  not  keeping  signal-to-noise  ratio  to  a  minimum,  hav¬ 
ing  tuning  frequencies  decrease  along  the  cascade  will  tend  to  minimize 
noise  levels  at  each  stage  of  the  cascade. 

Because  any  imperfections  of  the  signal  source  will  be  magnified  by  the 
rising  gain-frequency  response  characteristic  of  the  instrument,  it  is 
suggested  that  precautions  be  taken  to  minimize  distortion,  pickup  and 
noise  in  the  input  signal.  Similarly,  when  the  output  of  a  sine-wave 
signal  generator  is  used  as  a  test  input  signal,  imperfections  in  the 
signal  generator  output  that  are  barely  visible  on  an . oscilloscope  trace 
will  be  magnified  by  the  rising  gain-frequency  response  of  a  Null  Filter. 
Many  sine-wave  signal  generators  (including  the  Hewlett-Packard  Model  209A) 
have  small  discontinuities  at  the  sine-wave  zero-crossings.  These  will  be- 
accentuated  in  the  Null  Filter,  resulting  in  narrow  spikes  at  the  sine- 
wave  zero-cro3sings.  This  effect  is  most  easily  seen  at  TF^  in  the  Null 
Filter.  Another  imperfection  of  some  signal  generators  is  the  presence 
of  random  noise  added  to  the  signal  after  the  output  level  control.  When 
the  generator  output  is  set  to  minimum,  the  output  noise  will  still 
remain.  Thus,  when  testing  the  internally- generated  noise  of  the  instru¬ 
ment,  the  instrument  input  should  be  physically  shorted  to  remove  noise 
which  could  be  coming  from  the  signal  source. 


1 


Tuning 

Range 


2.0  Null  Filter  Functional  Description 

The  Null  Filter  has  a  target  transfer  function  which  represents  a 
second-order  anti-resonance  or  Null  Filter  with  unity  effective  DC 
gain,  and  is  given  by 


Hx  (S) 


(.?-r  b)'~  +  a^ 

b^  +  a? 


The  filter  frequency  and  bandwidth  parameters,  a  and  b  respectively, 
are  independently  tunable  over  the  audio  frequency  range  by  means  of 
precision  dials  calibrated  in  Hertz  (cps) . 

Modifications  to  the  above  transfer  function  incorporated  into  thn 
design  comprise  an  18  KHz  low-pass  filter  for  roll-off  of  overall 
high-frequency  response,  roll-off  of  the  term  at  100  KHz,  and 
polarity  inversion  (negative  sign)  of  the  effective  DC  gain  (extra¬ 
polation  of  the  low-frequency  gain  to  DC) . 


2.1  Hull  Filter  Specification  Summary 
2.1.1.  Controls 
IN-OUT  Switch 

GAIN  Control 

BW  Control  and  Range  Switch 
FREQ  Control  and  Range  Switch 

2.1.2  Accuracy 
FREQ  Dial 
BW  Dial 

Transfer  Function 


IN;  Output  BNC  connected  to  Input  BNC 
OUT:  Output  BNC  connected  to  filter 
output 

Adjusts  overall  gain  through  filter, 
after  setting  FREQ 

LOW  range:  100  Hz/turn,  up  to  1000  Hz 
HIGH  range:  1  KHz/turn,  up  to  5  KHz 
Limits:  As  defined  in  Fig.  1 
LOW  range:  100  Hz/turn,  up  to  1000  Hz 
HIGH  range:  1  KHz/turn,  up  to  5  KHz 
Limits:  As  defined  in  Fig.  1 


Adjustment  precision:  to.5/ of  value 
Calibration  accuracy:  +  2/  of  value 
Adjustment  precision:  +_  0.5$  of  FREQ 
for  FREQ  >  100  Hz  min.,  otherwise 
t  0.5  Hz 

Calibration  accuracy:  *  10/  of  value 
Signal  operating  range:  20  Hz  to  10  KHz 
Relative  amplitude:  t  0.25  dB  (1  2.9%) 
Delay  variation:  ±  0.10  msec 


2.1.3  Impedance  Levels 


Input 
Output 
Rated  Load 


2.2  kilohms  t  5/,  capacitor -coupled 
2  ohms  typical 
2  kilohms  minimum  impedance 


4 


2.1.k  Signal  levels 
Output 


Input 


2.1.5.  Noise  Level 


2.1.6  Test  Points 


Up  to  £  10V  peak  into  2  kilohms  mini¬ 
mum  load  impedance,  for  sine-wave 
signals  of  >  200  Hz  on  LOW  FREQ  and 
>2  KHz  on  HIGH  FREQ.  Below  these  fre¬ 
quencies,  maximum  output  is  determined 
by  internal  signal  level  at  TP5  or  TP6, 
and  is  a  function  of  FREQ  and  BN  con¬ 
trol  settings. 

Up  to  value  causing  maximum  output; 
varies  with  GAIN,  FREQ  and  BN  settings 
and  input  frequency.  The  proper  input 
signal  level  and  GAIN  setting  are  dis¬ 
cussed  in  Section  2.2. 

At  least  iiO  dB  below  7  Vrms  at  output; 
improves  with  increasing  FREQ  setting. 


All  test  points  are  isolated  by  resistors  of  680  or  1000  ohms  to  prevent 
damage  in  case  of  accidental  shorting  of  a  test  point  to  ground.  The 
test  points  are: 


TP1 

Input  connector 

TP2 

Spare 

TF3 

Differentiator  channel  output 

TPii 

Bandwidth  channel  output 

TP5 

Summing  amplifier  output,  (unfiltered) 

TP6 

Input  amplifier  output 

TP7 

+  15V  supply 

TP8 

-  15V  supply 

TP9 

Frequency  channel  output 

TP10 

Output  connector 

2.1.7  Power  Drain 

No-signal 

69  mA  at  +  15V,  -72  mA  at  -15V 

Normal  signals 

89  mA  at  +  15V,  -92  mA  at  -15V 

5 


2.2  Null  FiTser  Operating  Instructions 

An  appropriate  installation  arrangement  for  the  Null  Filter  is  shown  in 
Figure  2.  Each  filter  mounts  and  dismounts  by  means  of  f-turn  panel 
fasteners,  and  is  connected  by  means  of  BNC  signal  input  and  output  con¬ 
nectors  and  a  multi-pin  power  connector  in  the  rear. 

Front -panel  control  functions,  dial  calibrations  and  operating  limits, 
and  test -point  functions  are  listed  in  the  Specification  Summary.  After 
the  FREQ  and  BW  dials  have  been  set,  the  GAIN  may  be  set  as  high  as  ohe 
value  that  gives  unity  effective  DC  gain.  This  value  is  obtained  when 
the  output  amplitude  of  low-frequency  signals  (20  Hz)  is  unaffected  by 
IN -OUT  Switch  operation. 

If  the  GAIN  setting  or  input  signal  level  is  too  high,  saturation  or 
other  distortion  may  occur.  If  the  input  signal  level  is  too  low, 
signal -to -noise  ratio  may  be  reduced.  Distortion  conditions  are  best 
monitored  at  TPS’,  which  precedes  a  low-pass  filter  followed  by  an  output 
amplifier  having  a  gain  of  ten.  Signal  and  noise  amplitudes  are  best 
monitored  at  TP10  which  is  connected  to  the  output. 

Choice  of  control  settings  should  take  account  of  signal-to-noise  ratio, 
because  in  a  cascade  of  Null  Filter  units  the  steeply  rising  gain- 
frequency  characteristic  (12  dB/octave  per  Null  Filter)  introduces  sig¬ 
nificant  noise  gain  and  bandwidth.  This  rise  reaches  a  peak  at  18  KHz, 
where  the  low-pass  filter  in  each  Null  Filter  begins  to  roll  off.  In 
particular,  it  is  recommended  that  the  Null  Filter  GAIN  controls  be  set 
at  substantially  less  than  unity  effective  DC  gain  (value  discussed 
below).  This  will  help  to  keep  the  high  frequency  noise  of  the  first 
unit  still  moderately  small  at  the  output  of  the  last  unit.  The  noise 
gain  and  signal  gain  depend  on  FREQ  and  BW  settings  in  all  of  the  filter 
units . 

To  find  a  more  desirable  GAIN  setting,  let  us  assume  that  the  18  KHz 
noise  content  of  the  output  of  the  first  Null  Filter  is  5  mVrms.  This 
passes  through  four  Null  Filters,  one  of  which  is  approximately  balanced 
out  at  18  KHz  by  the  Resonance  Filter.  Let  us  also  assume  that  the 
final  18  KHz  noise  output  should  not'  exceed  1  Vrms .  Then  the  18  KHz 
gain  of  each  Null  Filter  should  be  '^I/o7oo5  =5.8.  This  corresponds  to 
unity  gain  at  idv6.75/5.8  =  6.5  KHz.  The  effective  DC  gain  will  be 
approximately  (FREQ/6.5  KHz)2,  which  is  below  unity  by  an  amount  dependent 
upon  the  FREQ  setting.  Thus  the  GAIN  control  can  simply  be  set  to  obtain 
unity  gain  through  each  Null  Filter  at  6.5  KHz  input  signal  frequency. 


6 


2.3  Kull  Filter  Circuit  Design 

Figure  3  is  a  simplified  transfer -function  diagram  of  the  Null  Filter. 
For  non-inverting  input  signals,  the  gain  of  an  operational  amplifier 
is  larger  by  unity  than  the  gain  for  inverting  inputs.  This  fact  is 
accounted  for  in  Stage  I4A,  where  both  inputs  are  used,  by  means  of  the 
attenuation  factor  shown  at  the  inverting  input. 


The  simplified  overall  transfer  function  resulting  from  Figure  3  is  as 
follows : 


H  (s)  =  -Gi  (T2  s2 
«  -Gi  K£  T2  (s2 


+  Te(%\  -  i)y%  +  y2  V  KhB  +  x2  h2) 
C%  -  ys  %  %  y2  k32  x\ 

+  .  +  - +  “t2““) 


We  wish  to  realize  the  ideal  transfer  function: 
%  (s)  =  (s2  +  2bs  +  b2  +  a2)  /  (b2  +  a2) 


Let  us  define  x  and  y  as  potentiometer  transmissions,  (maximum  =  unity), 
f  and  B¥  as  the  dial  readings  in  Hz,  and  F  as  the  full-ecale  dial  calir 
bration  of  10  kHz  for  both  dials.  We  then  have  these  relationships  to 
be  satisfied: 

a  =  21  f  =  2JTxF 

b  =  Jew  =  jryF 

2b  =  (%  -  l)KUBy/T  =  2fyF 

b2  =  %  kub  y2A2  =^2y^2 

a2  =  K32  x2/T2  =  (2fxF)2 

The  last  three  equalities  yield  the  design  constraints: 

(KU  -  1)%  =  2IFT 

(KJ4A  -  lj/%  =  2/jfTFT  =  1  -  1/% 

K3  =  2RFT 


In  this  design  the  unity- gain  frequency  of  the  differentiator  stages  has 
been  set  to  2  kHz.  This  leads  to  the  following  design  values: 


pw  ,IJ  - 


T  =  1/211(2  kHz)  =  79.?  usee 

K3  =  5.0 

%  -  5.0 
%  -  1.25 


The  variable  gain  control  permits  the  gain  factor  -GtK^T2  to  be 
adjusted  to  meet  the  design  requirement  of  unity  effective  DC  gain.  The 
gain  Gq  would  normally  be  varied  inversely  with  (a2  +  b2) .  This  factor 
can  reach  5j000  :  1,  which  would  use  up  much  of  the  dynamic  range  availa¬ 
ble  between  noise  and  saturation  levels  if  straight-forward  range  switch¬ 
ing  were  used.  This  potential  difficulty  is  largely  avoided  by  the  in¬ 
direct  method  U3ed  for  range  switching.  Ten-to-one  range  switetring  for 
both  variables  a  and  b  is  accomplished  by  scaling  all  other  factors  in 
the  opposite  direction.  This  is  shown  in  the  Circuit  Schematic  (Fig.  li) . 
The  effective  DC  gain  is  made  insensitive  to  the  Frequency  Range  Switch 
position.  When  the  Frequency  dial  is  maintained  at  one  turn  minimum  by 
means  of  frequency  range  switching,  the  variation  in  Gq  is  reduced  to 
only  200  :  1.  This  permits  reasonable  signal -to-noise  performance  and 
together  with  a  logarithmic  infinite-resolution  potentiometer  aids  manual 
gain  adjustment;. 

Design  factors  which  modify  the  transfer  function  above  10  kHz  are  the 
introduction  of  high-frequency  rolloff  in  the  differentiators  and  in  the 
overall  gain  function.  These  rolloffs  contribute  to  differentiator  sta¬ 
bility  and  to  overall  signal-t.o-noise  ratio. 

The  s2  term  in  the  ideal  transfer  function  corresponds  to  a  gain-fre¬ 
quency  asymptote  rising  at  12  dB/octave  at  the  upper  end  of  the  operating 
signal  frequency  range  (10  kHz).  Above  this  point  the  frequency  re¬ 
sponse  must  be  rolled  back  to  a  falling  asymptote  for  reasons  of  physi¬ 
cal  realizability,  noise  bandwidth  limitation,  and  to  maintain  stability 
even  in  the  presence  of  stray  coupling. 

Each  differentiator  stage  has  a  pair  of  real  poles  at  100  kHz,  producing 
only  -0.1  dB  and  -12°  at  10  kHz.  The  primary  rolloff  for  the  entire 
filter  transfer  ^unction  is  provided  by  a  fourth-order  Butterworth  low- 
pass  filter  at  the  output.  With  an  18-kHz  cutoff  frequency,  the  filter 
introduces  only  -0.1  dB  with  -87°  at  10  kHz.  Above  its  cutoff  frequency, 
the  fourth-order  filter  overrides  the  double  differentiator,  producing 
a  net  rolloff  of  12  dB/octave  up  to  100  kHz.  Beyond  100  kHz,  each  dif¬ 
ferentiator  becomes  -6  db/octave  instead  of  +  6  dB/octave.  The  net 
rolloff  beyond  100  kHz  thus  becomes  36  dB/octave. 

Maximum  overall  gain  occurs  at  the  filter  cutoff  frequency,  but  does 
not  exceed  21^,000  over  the  entire  range  of  dial  settings.  A  net  low- 
frequency  gain  inversion  is  utilized  to  make  overall  stability  more  in¬ 
sensitive  to  coupling  from  output  to  input.  Stray  coupling  is  minimized 
by  physical  separation  and  shielding  of  input  and  output  leads,  and  by 
multiple  bypassing  and  divided  routing  of  power-supply  lines. 


9 


10k  it* 


tck  ^ 


is.nk  !*• 


\.lk 


RANG*  JW.  S(<r  i 


•iikWai&JsSiii 


V 


Each  of  the  amplifier  stages  has  a  compensation  network  and  rolloff 
feedback  capacitor  selected  for  accurate  response  to  signal  frequencies 
and  effective  discrimination  against  higher  (noise)  frequencies. 

Emitter  followers  returned  to  current  sources  are  used  at  two  inter¬ 
stage  locations  for  driving  heavy  loads  with  minimum  amplifier  cross¬ 
over  distortion. 

The  input  amplifier  is  selected  for  low  noise  and  is  operated  at  low 
;  impedance  levels  to  minimize  the  voltage  output  caused  by  the  input 
current  noise. 

2,k  Null  Filter  Measured  Response 

The  results  of  response  measurements  taken  on  Null  Filter  #1  on 
Nov.  1,  1968  are  shown  in  Table  1.  The  frequency  and  bandwidth  settings 
were  1000  Hz  and  20  Hz,  respectively,  both  on  their  low  ranges.  Meas¬ 
urements  of  both  input  and  output  voltage  were  made  using  a  stable  wide¬ 
band  full-wave  operational  rectifier  feeding  a  Digitec  DC  digital  volt¬ 
meter  via  a  low-pass  filter.  Signal  frequency  of  the  Hewlett-Packard 
Model  209A  oscillator  was  monitored  with  a  Hewlett-Packard  9 12  frequency 
counter.  The  effective  DC  gain  of  the  Null  Filter  was  set  close  to 
unity,  ana  the  input  or  output,  whichever  was  larger  at  each  signal  fre¬ 
quency,  was  set  just  below  7  V  rms. 

The  measured  null  frequency  was  1007  Hz,  or  0.1%  high,  well  within  the 
i  2%  frequency  calibration  requirement.  The  ideal  response  data  for 
use  in  Table  1  was  computed  for  f  =  1006  Hz  and  BW  =  19  Hz  for  compari¬ 
son  with  the  actual  frequency  response. 

The  measured  values  were  corrected  for  rectifier  offset  due  to  zero 
error  and  noise,"  and  for  rectifier  amplitude  non-linearity  using  a  ca¬ 
libration  curve.  The  ideal  response  was  normalized  to  the  measured  low- 
frequency  gain  to  eliminate  the  effect  of  the  slight  difference  from 
unity  in  the  effective  DC  gain. 


11 


Table  1.  Measured  Response  at  1000  Hz  Frequency  and  20  Hz  Bandwidth 
Dial  Settings. 


Signal 

Frequency 

“S 

Hs 

Measured 

Response 

% 

Ideal 

Response 

HI 

(£“1006  Hz. 
m=  19  Hz) 

Ratio 

%/Kj 

Error 

dB 

1*0 

0.998U2 

0.9981*2 

1.000 

0 

700 

0.52151 

0.51601* 

1.011 

0.10 

7ul 

0 . 14.6260 

0.1*5771 

1.011 

0.10 

823 

0.33537 

0.33111* 

1.0U* 

0.12 

861* 

0.26570 

0.26295  . 

1:010 

0.09 

935*7 

0.1U087 

0.13609 

1.035 

0.30 

966.5 

0.08297 

0.07917 

1.01*8 

0.1*1 

993 

0.03280 

0.03180 

1.031 

0.27 

1007 

0.01701* 

0.01900 

0.897 

-0.91* 

1019 

0.03280 

0.03222 

1.015 

0.16 

1053 

0.11050 

0.10780 

1.025 

0.21 

1200 

0.1*2799 

0.1*2335 

1.011 

C.10 

1U52 

1.0955 

I.O83I* 

1.011 

0.10 

1757 

2.0635 

2.0503 

1.006 

0.05 

2572 

5-51*08 

5.5362 

1.001 

0.01 

1*11*3 

15.598 

15.959 

0.977 

-0.20 

12 


\ 


The  results  in  Table  1  show  that  the  relative  amplitude  limit  of  10.25  dB 
is  met  at  all  the  test  frequencies  except  at  and  near  the  null,  where  the 
response  is  down  20  to  35  dB.  The  largest  error  occurs  right  at  the  null. 
It  is  believed  that  these  errors  are  caused  primarily  by  measuring 
instrument  non-linearity  and  zero  offset,  which  are  large-  enough  to  re¬ 
quire  a  more  accurate  linearity  calibration  of  the  rectifier,  together 
;  .  a  rectifier  range  switching  arrangement,  to  resolve  definitely  the 

c<  for  the  apparent  disagreement  between  measured  and  ideal  responses 
near  the  deep  null.  The  response  was  deemed  to  be  close  enough  to  the 
ideal  not  to  warrant  development  of  more  precise  instrumentation. 

Bandwidth  dial  calibration  was  checked  by  taking  measurements  with  settings 
of  1000  Hz  frequency  and  200  Hz  bandwidth  (Q  *  5) .  With  unity  nominal 
effective  DC  gain,  the  measured  response  was  0.1988  at  1000  Hz  and  0.992? 
at  100  Hz,  giving  a  ratio  of  0.2003.  This  is  within  0.1$  of  the  ideal 
ratio  0.1983/0.9903  -  0.2002,  or  two  orders  of  magnitude  better  than  the 
t  10$  bandwidth  calibration  specification. 

Noise  output  measurements  taken  on  the  same  date  are  shown  in  Table  2. 

The  effective  DC  gain  was  set  to  unity  for  each  tuning  frequency,  and 
the  bandwidth  was  set  at  zero.  The  input  was  shorted,  representing  low 
impedance  of  the  input  signal  source.  The  output  readings  were  corrected 
for  rectifier  zero  offset  and  converted  to  rms  values.  The  noise  output 
is  highest  at  the  lowest  tuning  frequency,  where  the  transfer  function 
response  up  to  and  including  18  kHz  is  largest.  Using  the  maximum  avail¬ 
able  output  signal  of  ?  V  rms  as  a  reference,  the  signal-to-noise  ratio 
is  51  dB  or  better,  substantially  better  than  the  required  1*0  dB. 


\ 

* 


Table  2.  Measured  Noise  Output  with  Effective  DC  Gain  Set  to  Unity 
at  Various  Tuning  Frequencies. 


Frequency 

Setting 

f 

Hz 

Frequency 

Range 

Noise 

Output 

vrms 

Level 

Referred  to 
7Vrms 
dB 

100 

low 

0.020 

#1 

200 

low 

0.00# 

-62 

1000 

high 

0.0022 

-70 

2000 

high 

0.006 

-61 

I 


2.?  Null  Filler  Maintenance  and  Calibration 

Stability  of  performance  of  the  Null  Filter  is  safeguarded  by  means  of 
adequate  design  margins  and  frequency  compensation  techniques,  careful 
component  and  wiring  layout  and  shielding,  and  the  use  of  stable  metal- 
film  resistors  and  trim  potentiometers.  Critical  capacitors  are  stable 
low-loss  mica  types,  and  the  input  amplifier  is  a  selected  low-noise 
709C. 

Should  ib  be  necessary  to  replace  any  components,  consideration  should 
be  given,  after  the  repair  is  completed,  as  to  whether  the  gain  of  a 
critical  stage  (and  therefore  the  overall  calibration)  might  be  affected. 
This  applies  primarily  to  resistors  connected  to  the  input  terminals  of 
amplifiers  preceding  the  three-input  summing  amplifier.  Examination  of 
the  Factory  Calibration  Procedure  below  should  enable  determining  which, 
is  any,  calibration  steps  are  affected. 

Recalibration  due  to  aging  or  drift  should  not  be  necessary  for  at  least 
a  year.  A  simple  way  to  verify  stab:._/ty  is  to  check  null  frequency  at 
several  points  at  near- zero  bandwidth,  using  a  signal  generator  and  a 
frequency  counter. 

Below  is  the  Factory  Calibration  Procedure,  which  utilizes  a  DC  digital 
voltmeter  to  set  gain  and  attenuation  ratios  within  0.2$  accuracy.  Refer 
to  the  schematic  of  Figure  U. 

Factory  Calibration  Procedure 


1.  Check  alignment  of  electrical  zero  of  each  section  of  FREQ  and 
BW  pots  to  dial  zero,  using  an  ohmmeter. 

2.  Set  trimmer  #1  to  obtain  gain  -  -h  from  2B  output  to  liA  output. 

Set  BW  °  0,  and  obtain  2  TOC  at  2B  output  by  means  of  GAIN  pot 
and  jumpers  connecting  1*7  uF  negative  end  to  -l£v  and  hi  k  ohm 
across  1000  pF  feeding  2B. 

3.  Set  trimmer  §2  to  obtain  10:1  ratio  at  BW  pot  1H  terminal  with 
3W  Range  switching.  Use  BW  =  0,  and  10  TOC  at  output  of  709C 
stage. 

U.  Set  trimmer  #3  to  obtain  10:1  ratio  at  BW  pot  2H  terminal  with 

BW  Range  switching.  U ra  BW  =  0,  and  10  TOC  at  output  of  stage  i|A. 

5.  Set  trimmer  #U  to  obtain  gain  =  $/b  through  stage  I4B .  Use  BW  = 
approximately  7000  (high  range)  and  adjust  GAIN  to  obtain  8  TOC 
at  +  input  of  stage  1*3.  Set  FREQ  =  0  (high  range). 

6.  Pad  10FC  1$  resistor  at  output  of  stage  l^B  to  obtain  100:1  ratio 
at  arm  2  of  FREQ  RANGE  switch  between  high  and  low  positions. 

Use  10  TOC  at  1*B  output,  and  check  that  grounding  -  input  of 
stage  f>A  has  no  effect. 


15 


7.  Pad  10X  1%  resistor  at  output  of  stage  2A  to  obtain  100:1 
ratio  at  arm  1  of  FREQ  RANGE  switch,  as  above. 

8.  Check  for  unity  gain  through  stages  2B  and  2A  at  2000  Hz  input 
frequency,  and  trim  78. 7K  resistor  or  1000  pF  capacitor  if 
necessary. 

9.  Set  trimmer  #5  for  best  null  at  FREQ  =  1000  Hz  (low  range), 

BW  =  0  (low  range),  and  with  1000  Hz  input  signal.  Check  FREQ 
scale  reading  for  best  null  at  £00  Hz  input. 

10.  Adjust  variable  capacitor  at  stage  3-A  for  best  null  at  FREQ  = 

5  kHz  (high  range),  BW  =  0  (low  range)  and  $  kHz  input.  Check 
FREQ  scale  reading  for  2  kHz  and  1  kHz  input  signals. 

3.0  Resonance  Filter  Functional  Description 

The  Resonance  Filter  has  a  target  transfer  function  which  is  the  inverse 
of  the  Null  Filter  target  transfer  function.  It  is  given  by 


%  (s)  = 


b2  +  a2 
(s  +  b)2  +  a2 


The  Filter  frequency  and  bandwidth  parameters,  a  and  b  respectively, 
are  independently  tunable  over  the  audio  frequency  range  by  means  of 
precision  dials  calibrated  in  Hertz  (cps). 

The  only  modification  to  the  above  transfer  function  included  in  the 
design  is  the  inverted  polarity  (negative  sign)  of  the  effective  DC  gain. 


I 


.1  Resonance  Filter  Specification  Summary 


3.1.1  Controls 

IN -OUT  Switch  IN:  Output  BNC  connected  to  Input  BNC 

OUT:  Output  BNC  connected  to  filter 
output 

GAIN  Control  Adjusts  overall  gain  through  filter, 

after  setting  FREQ 

BW  Control  and  Range  Switch  LOW  range:  100  Hz/turn,  up  to  1000  Hz 

HIGH  range:  1  KHz/turn,  up  to  ?  KHz 
Limits:  As  defined  in  Fig.  1 

FREQ  Control  and  Range  Switch  LOW  range:  100  Hz/turn,  up  to  1000  Hz 

HIGH  range:  1  KHz/turn,  up  to  5  KHz 
Limits:  As  defined  in  Fig.  1 


3.1.2  Accuracy 

FREQ  Dial  Adjustment  precision:  1 0.3$ of  value 

Calibration  accuracy:  ±  2%  of  value 
BW  Dial  Adjustment  precision:  t  0.5%  of  FREQ 

for  FREQ  >  100  Hz  min.,  otherwise 
t  0.5  Hz 

Calibration  accuracy:  t  10^  of  value 
Transfer  Function  Signal  operating  range:  20  Hz  to  10  KHz 

Relative  amplitude:  i  0.2?  dB  ( t  2.9%) 
Delay  variation:  ±  0.10  msec 


3.1.3  Impedance  Levels 


Input 
Output 
hcitjed  Load 


3.1.U  Signal  Levels 
Output 


Input 


3  to  10  kilohms,  capacitor -coupled 
2  ohms  typical 
2  kilohms  minimum  impedance 


Up  to  ±  10V  peak  into  2  kilohms  mini¬ 
mum  load  impedance.  At  some  control 
settings,  maximum  output  is  determined 
by  internal  signal  levels,  by  the  require¬ 
ment  of  keeping  internal  levels  at  or 
below  i  10V. 

Up  to  value  causing  distortion  at  TP?; 
varies  with  GAIN  setting. 


3,1*?  Noise  Level 


At  least  U0  dB  below  7Vrms  at  output 


17 


3.1.6  Test  Points 


All  test  points  are  isolated  by  resistors  of  680  or  1000  ohms  to  prevent 
damage  in  case  of  accidental  shorting  of  a  test  point  to  ground.  The 
test  points  are: 


TP1 

TP2 

TP3 

TPl* 

TP5> 

TP6 

TP7 

TP8 

TP9 

TP10 


Input  connector 

Spare 

Spare 

Spare 

Frequency  feedback  channel  output 
Spare 

+  15>V  Supply 
-  10V  Supply 
Spare 

Output  connector 


3.1.7  Power  Drain 


No-signal  22  mA  at  +  lf>V,  -22  irA  at  -15V 

Normal  signals  ill  mA  at  +  1J>V,  -Hi  mA  at  -1S>V 

3.2  Resonance  Filter  Operating  Instructions 

An  appropriate  mounting  location  for  the  Resonance  Filter  is  shown  in 
Figure  2.  Filter  mounting  and  connection  are  the  same  as  described 
for  the  Null  Filter, 

Front-panel  control  functions,  dial  calibrations  and  operating  limits 
are  the  same  as  for  the  Null  Filter.  They  are  listed  in  the  Specifica¬ 
tion  Summary  together  with  other  parameters  of  the  Resonance  Filter. 

It  is  recommended  that  the  GAIN  control  be  set  for  unity  effective  DC 
gain.  The  considerations  which  affect  the  choise  of  Null  Filter  GAIN 
control  setting,  discussed  in  Section  2.3,  need  not  be  considered  here 
because  the  gain  of  the  Resonance  Filler  falls  at  frequencies  beyond 
resonance  rather  than  rising  like  the  Null  Filter.  Thus  noise  is 
attenuated,  rather  than  amplified,  and  the  GAIN  control  can  be  set  for 
unity  effective  DC  gain. 

Test  point  TP5  is  provided  to  aid  in  detecting  saturation  or  distortion 
conditions  due  to  excessive  input  signal  level.  Both  TP5,  the  output  of 
a  limiting  amplifier,  and  TP10,  the  Filter  output,  should  be  monitored 
for  this  purpose.  Proper  signal-to-noise  ratio  resulting  from  adequate 
input  signal  level  would  be  observed  at  TP10. 


18 


3.3  Resonance  Filter  Circuit  Design 

Figure  $  shows  a  simplified  transfer -function  diagram  of  the 
Resonance  Filter,  which  uses  feedback  combined  with  feed -forward 
through  two  integrating  amplifiers. 

The  net  input  to  the  summing  amplifier  can  be  expressed  as  follows: 
es  11  GlEin  +  %y  (Kiy  +  C1T2S^  EI2  +  CFKFK1x2ei2 


This  will  represent  the  ideal  transfer  function: 


H,  (s) 


b2  ♦  a- 


ac  ■>  2bs  +  bc  +  a‘ 


Using  the  same  definitions  as  Section  2.3,  we  obtain  the  following 
relationships: 


a  =  2flf  -  2lJxF 


b  =  TTyF 
2b  =  KsKBCIy/T1  =  2lTyF 
b2  =  KgK1KBy2/T1T2  =  TT2y2F2 
a2  =  KsCFK1Kpx2/T1T2  =  UTT2x2F2 


19 


From  the  last  t-hree  equalities  we  obtain  the  following  design 
constraints : 

Wl/Tl  = 

Kl/°IT2  =^f/2 

kscfk1kf/t1t2  -  Utt2f2 

For  this  design  we  utilize  F  =  10  KHz,  l/2]jT2  =  1  KHz,  T-^  =  2T2, 
and  Cj  =  2.  From  these  values  we  obtain: 

Vfe  =  10 

Ki  -  5 

Wp  - 

The  remaining  design  values  selected  are  Ks  =  k*5b  and  KsCp  =  2/3. 

Now  that  the  dynamics  of  the  transfer  function  axe  accounted  for,  by 
realizing  the  terms  of  the  denominator,  the  numerator  may  be  con¬ 
sidered  and  thus  the  range  of  gain  G]_  required. 

If  -he  output  were  taken  at  Ej~  the  configuration  of  Figure  3  would  be 
simpler.  One  would  then  expect  G^  to  track  tne  main  feedback  gain 
CpKpKj2  in'  order  to  approach  unity  closed-loop  low-frequency  gain. 

Because  x2  varies  over  a  2^00:1  range,  the  feedback  gain  is  selected 
to  vary  both  above  and  below  unity  in  order  that  the  maximum  allowable 
signal  at  es  and  Ej2  should  not  be  unduly  restricted.  This  limits  the 
reduction  of  signal-to-noise  ratio  occurring  at  the  extreme  f  dial  set¬ 
tings.  Selection  of  the  output  configuration  shown  strikes  a  balance 
between  good  overall  dynamic  range,  close  coupling  to  the  common  Ej2 
signal,  and  reduction  of  the  necessary  range  of  the  gain  control. 

The  gain  control  uses  an  infinite-resolution  logarithmic  potentiometer 
for  fine  manual  adjustability.  It  is  passively  connected  to  the  input 
such  that  the  input  impedance  varies  over  the  restricted  range  of  3K 
to  10K.  The  negative  sign  in  the  low-frequency  closed-loop  transfer 
function  results  from  the  design  configuration  and  implementation. 

Figure  6  shows  a  schematic  diagram  of  the  Resonance  Filter.  Voltage 
divider  networks  have  been  used  to  prevent  loading  and  to  avoid  emitter 
followers.  Direct  local  range  switching  is  used  at  low  impedance  levels. 

Two  circuit  additions  introduce  a  minimum  damping,  and  therefore  a  mini¬ 
mum  bandwidth  sufficient  to  insure  stability  when  bandwidth  is  set  near 
zero  (highest  Q)  but  not  so  large  as  to  affect  the  desired  range  of 
operation. 


21 


30)9/1$  7 


KS 


First,  a  bias-stabilizing  feedback  resistor  in  each  integrating  amp¬ 
lifier  modifies  its  transfer  function  from  l/Ts  to  l/T(s  +  O.Ol/T). 

Since  this  is  the  only  frequency-dependent  portion,  the  entire  trans¬ 
fer  function  is  modified  from  H  (s)  as  given  above  to  H  (s  +  O.Ol/T). 
The  effect  of  this  translation  in  the  complex  frequency  domain  is 
multiplication  of  the  corresponding  time  function  h  (t)  to  produce 
h  (t)  E-0*01VX  Th8  inverse  transform  of  H-j_  (s)  yields  the  impulse 
response  (t),  which  has  the  form: 

.  .  b^  +  Li 

hi  (t)  = _ e  D  sm  at 

a 

Thu3  the  effect  of  this  modification  on  the  dynamic  response  is  to 
increase  b  =TTyF  by  an  amount  O.Ol/T,  so  that  y  is  effectively 
increased  by  O.Ol/fTFT.  This  has  a  net  value  of  0.0015  (lOt)  -  0.015 
turn  of  dial  offset. 

The  second  circuit  addition  is  the  insertion  of  a  small  resistor  to 
ground  at  the  lower  end  of  each  bandwidth  potentiometer  section. 

This  raises  the  minimum  bandwidth  by  an  additional  amount  of  10t/200  = 
0.05  turn,  and  further  improves  the  built-in  dynamic  stability.  The 
total  offset  of  0.065  turn  is  calibrated  out  by  setting  the  dial  to 
read  0.065  turn  at  pot  electrical  zero.  This  corresponds  to  6.5  Hz 
minimum  achievable  bandwidth  on  the  low  range,  which  is  well  below  the 
20  Hz  minimum  bandwidth  setting  required. 

Adequate  bias  stabilization  of  the  integrating  amplifiers  is  provided 
by  local  feedback  when  the  gain  of  the  main  feedback  loop  is  low.  At 
the  higher  main  loop  feedback  gains,  however,  tighter  bias  stabiliza¬ 
tion  is  needed  which  is  accomplished  using  negative  DC  feedback  around 
the  main  loop.  Because  of  the  cascade  arrangement,  a  Burr-Brown 
Model  3019/15  low-input-current,  low-noise  amplifier  is  used  in  the 
first  integrator,  instead  of  the  709-type  amplifiers  used  elsewhere. 

Two  circuit  precautions  protect  the  stabilizing  signal  of  the  inner 
(bandwidth)  feedback  loop  from  being  attenuated  or  phase-shifted  in  the 
summing  amplifier  due  to  excessive  output  from  the  outer  (frequency) 
feedback  loop  under  overload  conditions.  The  frequency-signal  gain 
factor  KgCp  (See  Figure  5)  is  set  below  unity,  and  a  soft  limiter  is 
provided  at  the  output  of  Stage  F2.  These  precautions  prevent  large- 
signal  nonlinear  instabilities  that  might  otherwise  be  initiated  under 
transient  overload  conditions. 

The  ideal  response  function  for  the  Resonance  Filter  already  includes  a 
roll-off  above  the  tuning  frequency,  so  that  no  modifications  are 
needed  for  this  purpose.  All  amplifier  stages  have  closed-loop  roll¬ 
off  corner  frequencies  greater  than  100  KHz. 


23 


3.1)  Resonance  Filter  Measured  Response 

The  frequency  dial  calibration  of  the  Resonance  Filter  was  checked 
at  resonance  with  low  bandwidth  at  various  frequency  settings  on  both 
ranges  by  comparison  with  a  frequency  counter,  and  agreement  well  with¬ 
in  the  i  2%  specification  was  obtained. 

The  bandwidth  dial  calibration  was  checked  quickly  with  moderate  accur¬ 
acy  by  setting  a  fixed  signal  frequency  and  tuning  both  dials  to  obtain 
unity  gain  and  90-degree  phase  shift  on  the  oscilloscope.  This  condi¬ 
tion  corresponds  to  a  second-order  network  with  0.5  of  critical  damping 
buned  to  its  natural  (undamped)  resonant  frequency.  The  expressions 
for  this  relationship  are: 

0.5  =  b jja?  + 

2  fs  =  JT77 

When  these  are  solved,  the  relations  given  previously  yield  the  results 
BW  =  fs  and  f  =  BW/ 3/2.  The  corresponding  damped  Q  of/372  --  0.866 
for  this  chosen  te«t  condition  ic  slightly  beyond  the  Q  =  1  boundary  of 
the  required  tuning  range,  providing  greater  than  normal  sensitivity  to 
the  bandwidth  value. 

The  measured  settings  shown  in  Table  3  differ  from  the  ideal  values  by 
less  than  the  t  10%  bandwidth  calibration  specification.  Measurement 
accuracy  is  limited  by  the  use  of  the  oscilloscope  face  for  the  relative 
gain  and  the  Lissajous  phase-shift  pattern. 


Table  3.  Resonance  Filter  Bandwidth  Measurements 


Signal 

Frequency 

fs 

Hz 

Dial 

Measured  Setting 

turns 

Ideal 

Setting 

turns 

Error 

% 

2K 

BW 

1.56 

2.00 

_2 

f 

1.71 

1.73 

-1 

1000 

BW 

9.2'n 

10.0 

-8 

f 

9.00 

3.66 

h 

24 


3.5  Resonance  Filter  Maintenance  and  Calibration 


The  remarks  in  Section  2.5  concerning  the  limited  circumstances  under 
which  recalibration  of  the  Null  Filter  might  be  required  also  apply 
to  the  Resonance  Filter. 

The  following  Factory  Calibration  Procedure  utilizes  a  DC  digital  volt¬ 
meter  to  set  10:1  attenuation  ratios  within  0.2$  accuracy.  AC  gains 
should  be  set  to  unity  within  2$  accuracy,  utilizing  either  an  analog 
or  digital  AC  voltmeter.  Refer  to  the  schematic  of  Figure  6. 

Factory  Calibration  Procedure 

1.  Check  alignment  of  electrical  zero  of  each  BW  pot  section 
referred  to  its  lower  terminal,  which  corresponds  to  0.065 
turn  dial  reading.  Check  electrical  zero  of  each  FREQ  pot  sec¬ 
tion,  corresponding  to  zero  dial  reading. 

2.  Check  for  unity  gain  magnitude  from  II  output  to  12  output,  when 
a  1000  Hz  input  signal  is  applied  to  the  unit.  Trim  12  input 
resistor  or  feedback  capacitor  if  necessary. 

3.  Check  for  unity  gain  from  12  output  to  B1  output  with  50  Hz  input 
signal  and  BW  set  at  2  KHz.  Adjust  B1  potentiometer  as  needed. 

U.  Check  for  unity  gain  from  B1  output  to  B2  output  with  BW  set  at 

U.5U  KHz.  Trim  B2  feedback  resistor  if  necessary. 

5.  Check  that  the  transmission  from  12  output  to  the  high  end  of 
the  B1  section  BW  pot  falls  by  10:1  with  low  BW  range  switching. 
Remove  12  and  apply  DC  voltage  with  BW  and  f  at  minimum  dial 
settings.  Trim  150  ohm  resistor  if  necessary. 

6.  Check  that  the  transmission  from  B1  output  to  the  high  end  of 
the  B2  section  BW  pot  falls  by  10:1  with  low  BW  range  switching. 
Remove  B1  and  apply  DC  voltage.  Trim  120  ohm  resistor  if 
necessary. 

7.  Apply  1:000  Hz  input  signal,  set  BW  to  minimum  and  f  to  U  KHz 
(high  range),  and  vary  f  dial  to  find  peak  response  of  unit. 
Adjust  F2  potentiometer  to  obtain  f  =  h  KHz  dial  setting  for 
peak  response. 

8.  Apply  900  Hz  input  signal  with  BW  set  to  minimum,  set  f  to 
900  Hz  (low  range),  and  vary  f  dial  to  find  peak  response. 

Trim  1yd  ohm  resistor  at  FI  if  necessary  to  obtain  f  =  900  Hz 
dial  setting  for  peak  response. 


25 


\ 


TT— .  .1 _ J  £J 9 

unvjxaaa  aa.  acu 


1  DOCUMENT  CONTROL  DATA  -  R  &  D 

1  (Security  c/aaalf/cal/on  of  tltla,  body  <*l  abatract  and  Indaalnf  annotation  mu  ft  bo  ontorod  whon  Cho  orarall  ropott  l*  claaatUad)  I 

I.  ORIGINATING  ACTIVITY  (Co.^ormtn  tulhot) 

SIGNATRON,  Inc. 

594  Marrett  Road 

2*.  REPORT  SECURITY  CLASSIFICATION 

Unclassified 

2b.  CROUP 

3.  REPORT  TITLE 

INVESTIGATION  OF  FACTORS  AFFECTING 

THE  QUALITY  OF  VOCODER  SPEECH 

4.  descriptive  Ndi  C6  r*i Vfis  c*  ropoat  and  lnr.luzlvo  dote*) 

Final  -  Aoril  15.  1967  through  Mav  17,  1969  (Approved  17  -Tune  1969) 

fl.  AUTHOR!*)  (Fir  at  namo,  middle  Initial,  taa*  nmtna) 

Thomas  H.  Crystal 

fl.  RETORT  DATE 

May/7,  1969  _ 

7«.  TOTAL  NO.  OP  PACES  76.  NO.  OF  REFS 

78  13 

90.  CONTRACT  OR  GRANT  NO. 

F19628-67-C-0292 

b.  project  mo.  Task,  Work  Unit  Nos. 
4610-02-01 

c’DoD  Element:  62405454 
rfDoD  Subelement:  674610 

»£.  OaiQINATOK'S  REPORT  NUMBCRjl) 

SIG-CR-594 

9b.  other  REPORT  no{S)  (Any  othar  numbora  tnat  may  bo  aaa/pnod 
thla  TO}jott) 

AFCRL-69-0256 

10.  DISTRIBUTION  STATEMENT 

Distribution  of  this  document  is  unlimited.  It  may  be  released  to  the 
Clearinghouse ,  Department  of  Commerce,  for  sale  to  the  general  public. 

II-  SUPPLEMENTARY  NOTES 

TECH,  OTHER 

12.  SPONSORING  MILITARY  ACTIVITY 

Air  Force  Cambridge  Research 
Laboratories  (CRB) 

L.  G.  Hanscom  Field 
,  Bedford _ . Maaaa.c:hufififc.ts..  Q1 7 _ 

*3.  ABSTRACT 


Reaearch  into  and  the  development  of  instrumentation  for  the 
investigation  of  factors  affecting  the  quality  of  vocoded  speech 
are  documented.  The  work  reported  was  specifically  concerned  with 
developing  a  better  understanding  of  the  role  of  the  vocal  source 
in  the  production  of  both  synthetic  speech  and  of  natural  speech. 

The  design  of  and  operating  instructions  for  the  VOTIF  vocal  track 
inverse  filter  -  built  as  part  of  the  program  -  are  presented.  A 
theoretical  determination  of  the  interaction  between  the  vocal  source 
and  vocoder  channel  filters  has  been  made  and  the  effect  of  spectrum 
flattening  on  the  peak  factor  and  power  of  a  vocoder  channel  have 
been  computed.  Lastly,  the  pulsed  excitation  of  resonances  is  dis¬ 
cussed.  A  form  of  pitch  jitter  which  could  either  maximize  vocal 
output  or  minimize  vocal  tract  impedance  effects  is  reported  on.  - 


DD 


FORM 

1  NOV  6*5 


1473 


Unclassified 

Security  Classification 


6fo  J  77 


•  ran  usd*  817  *  TtL.  882-3389 


SIGNATRDN,  Ino. 

rasaarch  and  consulting 

MllLEP  BUILDING  .  8  8  4  MAKRtTT  HOAD  »  LIXINQTON.  MASSACHUSETTS  00173 


ERRATA  SHEET 


AFCRL-S9-0256 

INVESTIGATION  OF  FACTORS  AFFECTING 
THE  QUALITY  OF  VOCODER  SPEECH 

May  17,  1969 


page  3-8  In  the  first  line  of  Eq.  (3.13a), 

A  A, 

;  p  =  n  should  be  ;  p  ^  n 

page  3-8  In  the  first  line  of  Eq.  (3.13b), 

;  p  =  n  should  be  ;  p  ^  n 

Next  to  last  page  (Form  1473),  item  6,  Report  Date 

May  7,  1969  should  be  May  17,  1969. 

Last  page  (reverse  of  Form  1473),  item  14,  Key  Words 

should  be  added  as  follows: 

Inverse  Filter 

Instrumentation  Speech  Analysis 
Speech 

Speech  Production 
Vocal  Source 

Vocal  Tract  Inverse  Filter 
Vocoder 

Vocoder  Excitation 
Vocoder  Filter 
Vocoder  Quality 


Reft  AD  690179 


